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CTCGAGACTC GGTGGAAGGG COCTTATCTC GTACTTTIGA CCACAGCAAC 50 (SEQ ID NO: 1 ) 

GQCIGTCAAA GICGAAQGAA TCHCCAOCTG GATCCATCCA TCOCAOGTTA 100 

AGCCGQCGCC AGCTOOCGAT TCQQGGTGGA AAGCOGAAAA GACIGAAAAT 150 

CCCCTTAAGC TTCGCCTCCA TQGCGTOGTT CCTTACTCTG TCAATAAOCT 200 

CICAGACTAA TCGTAIGGGC ATAGGAGACA GCCIGAAGTC CCATAAACCC 250 

TTATCTCICA CCIGGTTAAT TACTGACTCC GGCACAGGTA TTAATATCAA 300 

CAACACICAA GGGGAGGCTC CnTAGGAAC CIGGIGGCCT GATCTATACG . 350 

TITQCCTCAG ATCAGTTATT CCTAGTUIGA CCTCACCGCC AGATATOCTC 400 

CATCCTCACG GATTITAIGT TTGCCCAGGA CCACCAAATA ATCGAAAACA 450 

TIGCGGAAAT CCCAGAGATT TCTTTIGTAA ACAATGGAAC TGTCTAACCT 500 

CTAATGATGG ATATIGGAAA T3GCCAACCT CICAGCAGGA TAGGGTAAGT 550 

TTTIUITATG TCAACAGCTA TACCAGCTCT GGACAATTTA ATTACCTGAC 600 

CTGGATTAGA ACTGGAAGCC OCAAG1GCTC TOCTICAGAC CTAGATTACC 650 

TAAAAATAAG TTICACTGAG AAAQGAAAAC AAGAAAATAT CC™^AATGG 700 

GTAAATQGTA TGTCTIGGQG AATGGTATAT TATGGAGGCT CGGGTAAACA 750 

ACCAGGCTCC ATIUTAACTA TICGCCTCAA AATAAACCAG CIGGAGCCTC 800 

CAATGGCTAT AGGAGCAAAT AOQGTCTIGA GGGGTCAAAG AOCOCCAACC 850 

CAAGGACCAG GACCATCCTC TAACATAACT TCT3GATCAG ACCCCACIGA 900 

GTCTAGCAGC ACGACTAAAA TGGGGGCAAA ACI'lTl'l'AGC CTCATGCAGG 950 

GAGCTTTICA AGCTCTTAAC TQCAGGACTC CAGAGGCTAC CTUl'lUl'lGT 1000 

TGGCTATGCT TAGCTTIGGG COCACCTTAC TATGAAQGAA TGGCTAGAAG 1050 

AGGGAAATIC AATGTGACAA AAGAACATAG AGACCAATGC ACAT3GGGAT 1100 

CCCAAAATAA GCTTACGCTT ACIGAGGTIT CIGGAAAAGG CACCTOCATA 1150 

GGAAAGGTIC CCCCATOCCA CCAACACCTT TGTAACCACA CTCAAGCCTT 1200 

TAATCAAACC TCIGAAAGTC AATATCTGGT AOCIGGTTAT GACAGGTQGT 1250 

iGTAA TACTGGATTA AOCCCTK7IG TTIOCACCTT GGTTTTTAAC 1300 
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CAAACTAAAG A TTTHG CAT TATGGTCCAA ATIGTICCCC .GAGTGTATTA 1350 (SEQ ID NO: 1) cont'd 

CTATCCCGAA AAAGCAATOC TTGAT3AATA TCACTACAGA AATCATOGAC 1400 

AAAAGAGAGA ACCCATATCT CTCACACTIG CIGTGAT3CT OGGACTIQGA 1450 

GTOQCAGCAG GIGTAGGAAC AGGAACAGCT GCCCTQGTCA 03QGACCACA 1500 

GCAGCTAGAA ACAGGACTTA GTAACCTACA TQGAATIGTA ACAGAAGATC 1550 

TCCAAGCOCT AGAAAAATCT GTCAGTAACC TCGAGGAATC CCTAACCTOC 1600 

TTATCTGAAG TAGTCCTACA GAATAGAAGA GGGTTAGATT TATTATTTCT 1650 

AAAAGAAGGA GGATTAIGTG TAGCCTK3AA GGAQGAATOC TGTTTITA'IG 1700 

TCGATCATTC AGGG3CCATC AGAGACTOCA T3AACAAACT TAGAGAAAGG 1750 

TIGGAGAAGC GTCGAAQGGA AAAGGAAACT ACTCAAGGGT GGTTIGAGGG 1800 

ATGGTTCAAC AGGTCTCCTT GGTIGGCTAC CCTACTITCT GCTITAACAG 1850 

GACCCTTAAT AGTCTTCCIC CTCTTACTCA CAGTIGGGCC ATGTATTATT 1900 

AACAAGTTAA TIGCCTICAT TAGAGAACGA ATAAG1GCAG TCCAGATCAT 1950 

GGTACTTAGA CAACAGTACC AAAGCCCGTC TAGCAGGGAA GCIGGCCQCT 2000 

AGCTCTACCA GTICTAAGAT TAGAACTAIT AACAAGAGAA GAAGTGGGGA 2050 

A1GAAAGGAT GAAAATACAA CCTAAGCTAA TGAGAAGCTT AAAATIGTIC 2100 

TGAATTCCAG AGTITCTTCC TTATAGGTAA AAGATTAGGT TTITIGCTGT 2150 

TTTAAAATAT GCGGAAGTAA AATAGGCCCT GAGTACA1GT CTCTAGGCAT 2200 

GAAACTICTT GAAACTATTT GAGATAACAA GAAAAGGGAG TTTCTAACTG 2250 

CTIGTITAGC TICTGTAAAA CIGGTIGCGC CATAAAGATG TIGAAATGTT .2300 

GATACACATA TCTIGGTGAC AACATCTCTC CCCCACCCCG AAACATGCGC 2350 

AAATCIGTAA CTCTAAAACA ATTTAAATTA ATIGGTCCAC GAAGCGCGGG 2400 

CTCTCGAAGT TTTAAATIGA CTCGTTIGTG ATATTTIGAA ATGATTGGTT 2450 

TGTAAAQQQC GQGCTTIGCT GTCAACCCCA TAAAAGCTGT CCCGACTCCA 2500 

CACTOQQGQC CGCAGTQCTC TACCCCTGCG TGGIGTACGA CIGTGQGCCC 2550 
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CAGOGCGCTT QGAATAAAAA TQCTCTIGCT GTTIQCATCA AGACOGCTTC 2600 (SEQ ID NO: 1) con; 'd 

TCGTGAGTGA TTAAGGGGAG TOGCCITTIC CGAGCCTGGA QGTICTITIT 2650 

GCTGGTCTTA CATTIGGGGG CIOGT0CQGG ATCIGTOGCG GCCACCCCTA 2700 

ACACCCGAGA ACCGACTTOG AQGTAAAAAG GATCCIUTTT TTAACGIGTA 2750 

TGCAIGTACC GGCCQGCGTC TCTGTIUIGA GTGTCIGTTT TCAGT3GTCC 2800 

GOGCTTICQG TTIGCAGCTG TOCICTCAQG COGTAAGGGC T3GGGGACTG 2850 

TGATCAGCAG ACGTGCTAQG AGGATCACAG GCTGCTGCCC TGGGGGAOGC 2900 

CCCGQGAGGT GAGGAGAGCC AGGGAOGCCT GGTGGICTCC TACTCTCGGT 2950 

CAGAGGACOG AATTCTGTIG CTCAAGCGAA AGCTIGOCCC TCQGCGAQOG 3000 

TCCGACTGTT TIGCCIGCTT GIGGAATACG IGGACGGGTC ACGTCIGTCT 3050 

QGATCTGTIG GTTTCTGTTT T3TCTGTCTT TCIUITGTGT GTCCTTGTCT 3100 

ACAGTTTTAA TATCQGACAG AOQGTGACGA CCGCTCTTAG TTTGACICTC 3150 

GACCATIGGA CTGAAGTTAA ATCCAGGGCT CATAATTIGT CAGTICAGGT 3200 

TAAGAAGGGA CCTIGGCAGA CTITCTCIGT CIUIGAATGG GCGACATICG 3250 

ATCTK3GATG GCCATCAGAG GGGACCTTTA ATTCTGAGAT TATCCTGGCT 3300 

GTTAAAGCAA. TTATTTTICA GACTGGACCC GGCICICATC GGGATCAGGA 3350 

GCCCTATATC CITAOGTGGC AAGATTIGGC AGAGGATCCT CCGCCATGGG 3400 

TTAAACCATG GCIGAATAAG CCAAGAAAGC CAQGTCCCOG AATTCTGQCT 3450 

CTIGGAGAGA AAAACAAACA CTQGQCTCAA. AAAGTCAAGC OC1CTCCICA 3500 

TATCTACCCC GAGATTCAGG AAGCACQGGC TIGGGGGGAA GGCCAATCIG 3550 

TOXCOCACC COCTTATCIG GCACAQQGfIG CGGGGAQGGG ACCOTTOOC 3600 

CCIGCTGGAG CTOOGGCGGT GGAGGGACCT TCIGCAGQGA CTCGGAGCCG 3650 

GAGGGGCGCC ACCCCGGAGC GGACAGAOGA GATCGOGACA TTACCGCTCC 3700 

GCAGGTACQG CGOTOOCACA COQQQQGGCC AATIGCAGCC OCTCCAGTAT 3750 

TOGGCCri'lT CTTCTOCAGA TCICTATAAT TGGAAAACTA ACCATCCCCC 3800 
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TTICTOGGAG GATCCCCAAC GCCTCAOQGG GTIGGIGGAG TCOCTTAIGT 3850 (SEQ ID NO: 1) corXd 

TCTCTCACCA GCCTACTTCG GATGATTGTC AACAQCIGCT GCAGACACTC 3900 

TTCACAACOG AGGAQOGAGA GAGAArgPCTA TTAGAGGCTA GAAAAAATCT 3950 

TOCIQQQGQC GADGGGQGAC CCACX3CQGTT GCAAAATGAG ATTCACATGG 4000 . 

GATTTCCCTT AACICGOCCC GGfTIGGGACT ACAACACGQC TCAAQGTAQG 4050 

GAGAGCTTGA AAATCTATCG OCAQGCTCTG GTQGOQQGTC T0QQQGGO3C 4100 

CICAAGACQG CCCACTAATT TOGCTAAGGT AAGAGAAGTDG ATGCAQQGAC 4150 

CGAATGAACC CCCCICIGTT TITCTTGAGA GGCICTIGGA AGCCTTCAGG 4200 

CQGTACACCC CTITIGATCC CACCTCAGAG GCOCAAAAAG CCTCAGTGGC 4250 

TTTOGCCTIT ATAQGACAGT CAGOCTTOGA TATTAGAAAG AAGCTICAGA 4300 

GACIGGAAGG GTTACAQGAG GCT3AGTTAC GTGATCTAGT GAAGGAQGCA 4350 

GAGAAAGTAT ATEACAAAAG GGAGACAGAA GAAGAAAGQG AACAAAGAAA 4400 

AGAGAGAGAA AGAGAGGAAA GGGAGGAAAG ACGTAATAAA GGGCAAGAGA 4450 

AGAATTIGAC TAAGATCTIG GCTGCAGTGG TIGAA3GGAA AAGCAATACG 4500 

GAAAGAGAGA GAGATTTTAG GAAAATTAQG TCAGGOCCTA GACACTCAGG 4550 

GAACCIGGGC AATAGGACCC CACTQGACAA GGACCAA1GT GCATATIGTA 4600 

AAGAAAGAGG ACACIQQGCA AGGAACIGOC CCAAGAAGQG AAACAAAGGA 4650 

CCAAGGATCC TAGCTCTAGA AGAAGATAAA GATTA3GGGA GAOGGGGTTC 4700 

GGACCOCCTC CCQGAGGCCA GGGTAACTIT GAAQGTGGAG GQGCAACCAG 4750 

TTGAGTICCT GGTIGATACC GGAQCGAAAC ATICAGIGCT ACTACAGCCA 4800 

TTAQGAAAAC TAAAAGATAA AAAATOCTQG GIGAT3GGTG CACAQQGCAA 4850 

CAACAGTATC CAIQGACTAC CQGAAGACAG TTGACTIGGG AGTQQGACQG 4900 

GTAACCCACT CXHTIUIGGT CATACCIGAG TGCCCAGCAC COCTCTTAGG 4950 

TAGAGACTTA TIGACCAAGA TGGGAGCACA AATTICTTIT GAACAAGGGA 5000 

AACCAGAAGT GTCIGCAAAT AACAAACCTA TCACTOIGTT GACCCTCCAA 5050 
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TTAGATGAOG AATATOGACT ATACTCTOGC CTAGTAAAGC CIGATCAAAA 
TATACAATIC TGGTK3GAAC AGTTTCCCCA AGCCIQQGCA GAAACCGCAG 
GGATGGGTTT GGCAAAGCAA GTTCCCCCAC AAGTTATTCA ACTGAAQGCC 
AGTOCCACAC CAGTCTCAGT CAGACAGTAC CCCTIGAGTA AAGAAQCTCA 
AGAAQGAATT GGGCOGCATG TCCAAAGATT AATCCAACAG GQCATQCTAG 
TICCTGTCCA ATCTQOCTGG AATACTCOOC TQCTACCX3GT TAGAAAGCCT 
.GGGACTAATG ACTATCGACC AGTACAGGAC TIGAGAGAGG TCAATAAACG 
GGTGCAGGAT ATACACOCAA CAGTCCQGAA CXXTTATAAC CTCTTGTGIG 
CIXTPXCACC CCAACGGAGC TOGTATACAG TATIQGACTT AAAGGATCCC 
TK7JTDCTGCX: TGAGATTACA CCCCACTAQC CAACCACTIT TIGOC7ITOGA 
A1GGAGAGAT CCAGGTACX3G GAAGAACOQG GCAGCTCAOC TC5GACQCGAC 
TGCOOCAAGG GTICAAGAAC TCCCCGACCA TCTTIGACGA AGCCCTACAC 
AGAGAOCTQG (XAACTICAG GATCCAACAC CCTCAGGIGA COCTCCTCCA 
GTAQGTGGAT GACCIGCITC T9GCQGGAQC CACCAAACAG GAC1GCTTAG 
AAGGCACGAA GGCACTACTG CTGGAATIGT CTGACCTAGG CTACAGAGCC 
TCIGCTAAGA AGGCCCAGAT TIGCAGGAGA GAGGTAACAT ACTIGGGGTA 
CAGTTTAQQG GACGGGCAGC GATCGCIGAC GGAGGCAOGG AAGAAAACIG 
TAGTCCAGAT ACCGGCOQCA ACCACAGCCA AACAAATGAG AGAGl'iTl'lG 
GQGACAGCIG GATJ.T1UCAG ACTGTOGATC CCQQGGTTIG CX3AC3CTIAGC 
AGCOXACTC TACCCQCTAA CCAAAGAAAA AQQQGAATIC TXTQGGCTC 
CIGAGCAOCA GAAQQCATIT GATCCTATCA AAAAGQCCCT GCTGAGOGCA 
CCTOCTCTOG CCCTCOCIGA CGTAACTAAA OCCTTTACCC TITATGTQGA 
TGAGOGTAAG GGAGTAGCCC GQGGAGTTIT AACCCAAACC CTAGGACCAT 
QGAGAAGACC TCTCX3CCTAC CTGTCAAAGA AGCTC3ATCC 1GTAGCCAGT 
GGTIQGCOCA TATGCCTGAA GQCTATCGCA GCIGIQQOCA TACIGGTCAA 
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QGACGCTGAC AAATIGACTT TOQGACAAGA ATATAACIGT AATAGCCOCC 6350 (SEQ ID NO: 1) cont'd 

CATOCATTGG AGAACATCGT TC3GCAGCCC CCAGACQGAT QGATGACCAA 6400 

CGCCCGCATC ACCCACTATC AAAGCCTGCT TCICACAGAG AGGGTCAQGT 6450 

TCGCTCCACC AACCX3CIUIC AACCCK3CCA CTCTICTQCC TGAAGAGACT 6500 

GATCAACCAG TGACICAIGA TIGCCATCAA CTATIGATIG AGGAGACIGG 6550 

GGTCCGCAAG GACCTTACAG ACATACCGCT GACTGGAGAA GTGCTAACCT 6600 

QGTICACIGA CGGAAGCAGC TATGflGGTQG AAGGTAAGAG GATGGCIQQG 6650 

GCGGCGGTQG TQGACGQGAC COGCAOGATC TQQGOCAGCA GCCTGCOGGG 6700 

AGGAACTTCA GCACAAAAGG CTGAGCTCAT QGCCCTCACG CAAQCTTTGC 6750 

GGCIQGCCGA AGGGAAATCC ATAAACATTT ATAOGGACAG CAGGTATOCC 6800 

TITOQGACTG CACAGGTACA TOQGGCCATC TATAAACAAA GGGGGTIGCT 6850 

TACCTCAGCA GGGAGGGAAA TAAAGAACAA AGAQGAAATT CTAAGCCTAT 6900 

TAGAAGCOGT AC^TITACCA AAAAGGCTAG CTATTATACA CTCICCTQGA 6950 

CATCAGAAAG CTAAAGATCT CATATGCAGA GGAAACCAGA 1GGCIGAGCG 7000 

QGTIGCCAAG CAGGCAGCCC AGGGTGTTAA CCTIUIQGCT ATAATAGAAA 7050 

TGCCCAAAGC GCCAGAACCC AGACGACAGT ACACOCTAGA AGACTGGCAA 7100 

GAGATAAAAA AGATAGACCA TIC7IUIGAGA CTCCQGAAGG GACCIGCTAT 7150 

ACCTCAGATG GGAAQGAAAT CCTGCGCCAC AAAGAAGGGT TAGAATATGT 7200 

CCAACAAGAT ACATGGTCTA AGCCAGCTAG GAACTAAACA GCTQCAGCAG 7250 

TIGGTCAGAA CATCCCCTTA TCATGTTCIG AGGCTAOCAG GAGTGGCIGA 7300 

CTCQGTGGTC AAACATIGTG TOCCCT3CCA GCIGGTTAAT GCTAATCCTT 7350 

CCAGAATQCC TOCAGQGAAG AGACTAAQQG GAAGCCACCC AGQCGCTCAC 7400 

TGGGAAGTOG ACTTCACTGA GGTAAAGCGG GCTAAATATG GAAACAAATA 7450 

OCTATTGGTT TTIGTAGACA CCTTTICAGG ATOGGTAGAG GCTTATCCTA 7500 

CTAAGAAAGA GACTTCAACC GTQGTAGCTA AAAAAATACT GGAAGAAATT 7550 
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TITOCAAGAT TTQGAATACC TAAGGTAATA GQGTCAGACA ATGGTOCAGC 7600 (SEQ ID NO: 1) cont'd 

TTTIGTTGCC CAQGTAAGTC AQGGACTCGC CAAGATATIG GGGATIGATT 7650 

GGAAACTGCA TIGTGCATAC AGACCCCAAA GCTCAGGACA GGTAGAGAGG 7700 

ATGAATAGAA CCATTAAAGA GACCCITACT AAATTCACOG OGGAGACIGG 7750 

CGTTAATCAT TOGATAGCTC TCCYGCCCYT TCIGCTTTTT AQQGTTAGGA 7800 

ACACCGCIGG ACAGTTIGGG CTCACCCCCT ATCAATTACT CTAQGGGQGA 7850 

CCXXTQCCAT TQGTAGAAAT TOCTIUIGTA CATAGTGCIG ATCTOCTQCT 7900 

TIGOCAGCCT TK3TTCTCIA GQCTCAAQQC ACTIGAGTOG GTCAGACAAC 7950 

GAQGGTQGAG GCAACTCCX3G GAGGCCTACT CAGGAGGAQG AGACTIGCAG 8000 

ATCXXACATC GTITCCAAGT GGGAGATTCA GTCTAOGrTA GACGCCAOOG 8050 

TGCAGGAAAC 8060 
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io 20 30 40. 50 60 (SEQ ID NO: 2) 

* ★ * * * * * * * * * ★ 

CTACCCCTGC GTQGTCTACG ACTCTOGGCC CCAGCGCGCP TQGAATAAAA ATCCTCTTCC 

70 80 90 100 110 120 

* * * * * * * * * * * * 

TGTTTOCATC AAGACCGCTT CITCTCAGTC ATTTGGGGTG TCGCCTCTTC OGAGCCCGGA 

130 140 150 160 170 180 

* • * * * * * * * * * * * 

CGAGGGGGAT TGTTCTTTTA CTQQCCTTIC ATTTCGTQCG TIGGCCGGGA AATCCTCOGA 

190 200 210 220 230 240 

* * * * * * * * + * * * 

CCACCCCTTA CACCCGAGAA COGACTTOGA GGTAAAGGGA TCCCCTTTGG AACATATGTC 

250 260 270 280 290 300 

* * ** * * * * * * *★ 

TCTGTQGGCC GGOGTCTCTG TTCTGAGTCT CTGTTTTOGG TGATGCGCGC TTICGGTTTG 

310 320 330 340 350 360 

** * + ** * * ★* ★* 

CAGCTGTCCT CTCAGACCGT AAGGACTGGA GGACTGTCAT CAGCAGACGT GCTAGGAGGA 

370 380 390 400 410 420 

** ** ** ** ★* * * 

TC^^^GCTG CCACCCTGGG QGACGCCCCG GGAGGTGGGG AGAGCCAGGG ACGCCTQGTG 

430 440 450 460 470 480 

************ 

GTCTCCTACT GTOGGTCAGA GGACCGAGTT CTGTTGTTGA AGOGAAAGCT TCCCCCTCCG 

490 500 510 520 530 540 

** ** ** ** ** ** 

CGGCCGTCCG ACTCTTTT3C CTGCITCTOG AAGACGCGGA CGGGTCGCGT GTGTCTGGAT 

550 560 570 580 590 600 

** ** ** ** ** * * 

CIGTTQGTTT CTCTTTCGTG TGTCTTTGTC TTGTGCGTCC TTGTCTACAG TTTTAAT ATG 

Met> 

610 620 630 640 



* 




* 


* 




* 


* 


* * 




* * 




GGA CAG 
Gly Gin 


ACA 
Thr 


GTG 
Val 


ACT 
Thr 


ACC 
Thr 


CCC CTT 
Pro Leu 


ACT 
Ser 


TTC ACT CTC 
Leu Thr Leu 


GAC 
Asp 


CAT TOG 
His Trp 


ACT 
Thr> 


650 
* 


* 


660 
* 




* 


670 
★ 


* 


680 
* 


* 


690 
* 


* 


GAA GTT 
Glu Val 


AGA 
Arg 


TCC 
Ser 


AGG 
Arg 


GCT 
Ala 


CAT AAT 
His Asn 


TTC 
Leu 


TCA GTT CAG 
Ser Val Gin 


GTT 
Val 


AAG AAG 
Lys Lys 


GGA 
Gly> 


700 
* 


* 


710 
* 




720 
* * 




730 
* * 


* 


740 
* 




CCT TGG 
Pro Trp 


CAG 
Gin 


ACT 
Thr 


TTC 
Phe 


TGT 
Cys 


GCC TCT 
Ala Ser 


GAA 
Glu 


TGG CCA ACA 
Trp Pro Thr 


TTC 
Phe 


GAT GTT 
Asp Val 


GGA 
Gly> 



FIGURE 2 



Page 9 of 34 

MOLECULAR SEQUENCE OF SWINE RETROVIRUS AND METHODS OF USE 















Jay A. Fishman 






















14846-0U004 












750 






760 




770 






780 




790 


* 


* 






* 


* 




* 




* 


* 


* 






CCA TCA GAG 


GGG ACC 


TTT 


AAT 


TCT GAA 


ATT 


ATC CTC GCT 


GTT AAG 


Trp 


Pro Ser Glu 


Gly Thr Phe 


Asn 


Ser Glu 


He 


He Leu, Ala 


Val Lys> 




800 




810 






820 




830 


840 


★ 




* 




* * 




★ 




★ 


* 










ATC 


ATT TTT 


CAG ACT GGA 


CCC 


GGC 


TCT 


CAT 


CCT 


GAT CAG 


GAG CCC 


Ala 


He 


He 


Phe 


Gin Thr Gly 


Pro 


Gly 


Ser 


His 


Pro 


Asp Gin 


Glu Pro> 






850 


860 






870 






880 






* 






* 


★ 




★ 


* 




* 


★ 




TAT 


ATC 


CTT 


ACG 


TOG CAA 


GAT 


TTC 


GCA 


GAA 


GAT 


CCT 


CCG CCA 


TOG GTT 


Tyr 


He 


Leu 


Thr 


Trp Gin 


Asp 


Leu 


Ala 


Glu 


Asp 


Pro 


Pro Pro 


Trp Val> 


890 






900 




910 




920 




930 




★ 




* 


★ 


★ 




* 


★ 




* 




★ * 




AAA 


CCA 


TOG 


CTA 


AAT AAA 


CCA 


AGA 


AAG 


CCA 


GGT 


CCC 


OGA ATC 


CTC GCT 


Lys 


Pro 


Trp 


Leu 


Asn Lys 


Pro 


Arg 


Lys 


Pro 


Gly 


Pro 


Arg He 


Leu Ala> 


940 




950 




960 






970 


980 




★ 


* 




* 


★ 






* 




★ 




* 


CTT 


GGA 


GAG 


AAA 


AAC AAA 


CAC 


TCG 


GCC 


GAA 


AAA GTC 


GAG CCC 


TCT CCT 


Leu 


Gly 


Glu 


Lys 


Asn Lys 


His 


Ser 


Ala 


Glu 


Lys Val 


Glu Pro 


Ser Pro 




990 






1000 




1010 




1020 




1030 


* 


* 




* 


★ 


* 




* 




* 


★ 


* 


* 


OCT 


ATC 


TAC 


CCC 


GAG ATC 


GAG 


GAG 


COG 


COG 


ACT 


TOG 


CCG GAA 


CCC CAA 


Arg 


He 


Tyr 


Pro 


Glu He 


Glu 


Glu 


Pro 


Pro 


Thr 


Trp 


Pro Glu 


Pro Gln> 




" 1040 




1050 






1060 




1070 


1080 


★ 




* 




* * 




★ 




* 


★ 




-*- 


* ★ 


CCT 


GTT 


CCC 


CCA 


CCC CCT 


TAT 


CCA 


GCA CAG 


GGT 


GCT 


GTC AGG 


GGA CCC 


Pro 


Val 


Pro 


Pro 


Pro Pro 


Tyr 


Pro 


Ala Gin 


Gly 


Ala 


Val Arg 


Gly Pro 






1090 


1100 




1110 






, 1120 






★ 




★ 


* 


★ 




* 


* 




★ 


* 


* 


TCT 


GCC 


CCT CCT 


GGA GCT 


CCG 


GTG 


GTG 


GAG 


GGA 


CCT 


GCT GCC 


GOG ACT 


Ser 


Ala 


Pro 


Pro 


Gly Ala 


Pro 


Val 


Val 


Glu 


Gly 


Pro 


Ala Ala 


Gly Thr> 


L130 




1140 




1150 




1160 




1170 




* 




* 


* 


★ 




★ 


★ 




* 




* ★ 


* 


CGG 


AGC 


OGG 


AGA 


GGC GCC 


ACC CCG 


GAG 


CGG 


ACA 


GAC 


GAG ATC 


GOG ATA 


Arg 


Ser 


Arg 


Arg 


Gly Ala 


Thr 


Pro 


Glu 


Arg 


Thr 


Asp 


Glu He 


Ala Ile> 


1180 




1190 


1200 






1210 


1220 




★ 


★ 




★ 


* 


* 




* 




* 


★ 


* 


TTA CCG 


CTG 


COC 


ACC TAT 


GGC 


CCT 


CCC 


ATC 


OCA GGG 


GGC CAA 


TTC CAG 


Leu 


Pro 


Leu 


Arg 


Thr Tyr 


Gly 


Pro 


Pro 


Met 


Pro Gly 


Gly Gin 


Leu Gln> 


1230 






1240 




1250 




1260 




1270 


* 


* 




* 


* 


★ 




★ 




* 


* 


+ 


★ 


COC CTC 


CAG 


TAT 


TOG CCC 


TTT 


TCT TCT 


GCA 


GAT CTC 


TAT AAT 


TOG AAA 


Pro 


Leu 


Gin 


Tyr 


Trp Pro 


Phe 


Ser 


Ser 


Ala 


Asp Leu 


Tyr Asn 


Trp Lys> 




1280 




1290 






1300 




1310 


1320 


* 








* * 




* 




★ 


* 




* 


★ * 


ACT AAC CAT 


CCC 


CCT TTC 


TOG 


GAG GAT CCC 


CAA CGC CTC AOG 


GGG TTC 


Thr Asn His 


Pro 


Pro Phe 


Ser 


Glu Asp Pro 


Gin Arg Leu Thr 


Gly Leu> 



(SEQ ID NO: 2) cont'd 
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1330 1340 1350 

* * * ★ * ★ 

GTG GAG TCC CTT ATG TIC TCT CAC CAG CCT ACT 
Val Glu Ser Leu Met Phe Ser His Gin Pro Thr 

1370 1380 1390 1400 

* * * * * ★ * 

CAG CTG CTG CAG ACA CTC TTC ACA ACC GAG GAG 
Gin Leu Leu Gin Thr Leu Phe Thr Thr Glu Glu 



* * . (SEQ ID NO: 2) cont'd 

TOG GAT GAT 7G7 CAA 
Trp Asp Asp Cys Gln> 

1410 
* ★ * 

CGA GAG AGA AIT CTG 
Arg Glu Arg lie Leu> 



1420 1430 1440 1450 1460 

★ * * *★ * * * ★ 

TTA GAG GCT AAA AAA AAT GTT CCT GGG GCC GAC GGG CGA CCC ACG CAG 
Leu Glu Ala Lys Lys Asn Val Pro Gly Ala Asp Gly Arg Pro Thr Gln> 

1470 1480 1490 1500 1510 

* * * * * * * * * * 

TTG CAA AAT GAG ATT GAC ATG GGA ITT CCC TIG ACT OGC CCC GOT TGG 
Leu Gin Asn Glu He Asp Met Gly Phe Pro Leu Thr Arg Pro Gly Trp> 



1520 1530 
* * * ★ * 

GAC TAC AAC AOG GCT GAA GGT AGG 
Asp Tyr Asn Thr Ala Glu Gly Arg 



1540 1550 1560 

* * * * *• 

GAG AGC TTG AAA ATC TAT CGC CAG 
Glu Ser Leu Lys He Tyr Arg Gln> 



1570 1580 1590 1600 

* * * * ★ X ft * 

GCT CTG GIG GOG GGT CTC GGG GGC GCC TCA AGA OG3 CCC ACT AAT TTG 
Ala Leu Val Ala Gly Leu Arg Gly Ala Ser Arg Arg Pro Thr Asn Leu> 



1610 1620 1630 1640 1650 



* * 


* * 


★ 


★ 


* 


* 


★ 


« * 


GCT AAG GTA 


AGA GAG GTC 


ATG CAG 


GGA CCG 


AAC GAA 


CCT 


CCC 


TOG GTA 


Ala Lys Val 


Arg Glu Val 


Met Gin 


Gly Pro 


Asn Glu 


Pro 


Pro 


Ser Val> 


1660 


1670 


1680 




1690 




1700 


* * 


★ 


* * 


* 


* 


* 




* 


TTT CTT GAG 


AGG CTC ATG 


GAA GCC 


TTC AGG 


OGG TTC 


ACC 


CCT 


TTT GAT 


Phe Leu Glu 


Arg Leu Met 


Glu Ala 


Phe Arg 


Arg Phe 


Thr 


Pro 


Phe Asp> 


1710 


1720 


1730 


1740 






1750 


★ * 


* * 


★ 


* 


* * 




* 




CCT ACC TCA 


GAG GCC CAG 


AAA GCC 


TCA GTG 


GCC CTG 


GCC 


TTC 


ATT GGG 


Pro Thr Ser 


Glu Ala Gin 


Lys Ala 


Ser Val 


Ala Leu 


Ala 


Phe 


He Gly> 


1760 


1770 




1780 


1790 




1800 


* * 


* * 


* 


* 


★ 


* 




* * 


CAG TOG GCT 


CTG GAT ATC 


AGG AAG 


AAA CTT 


CAG AGA 


CTG 


GAA 


GGG TTA 


Gin Ser Ala 


Leu Asp He 


Arg Lys 


Lys Leu 


Gin Arg 


Leu 


Glu 


Gly Leu> 



1810 1820 1830 1840 

* * * * * it * ** 

CAG GAG GCT GAG TTA OGT GAT CTA GTG AGA GAG GCA GAG AAG GTG TAT 
Gin Glu Ala Glu Leu Arg Asp Leu Val Arg Glu Ala Glu Lys Val Tyr> 



1850 1860 1870 1880 1890 

* * * * * ★ * * * * 

TAC AGA AGG GAG ACA GAA GAG GAG AAG GAA CAG AGA AAA GAA AAG GAG 
Tyr Arg Arg Glu Thr Glu Glu Glu Lys Glu Gin Arg Lys Glu Lys Glu> 
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1900 1910 1920 1930 1940 

* * * ***** * 

AGA GAA GAA AGG GAG GAA AGA OGT GAT AGA CGG CAA GAG AAG AAT TIG 
Arg Glu Glu Arg Glu Glu Arg Arg Asp Arg Arg Gin Glu Lys Asn Leu> 

1950 1960 1970 1980 1990 

********** 

ACT AAG ATC TIG GCC GCA GTG GTT GAA GGG AAG AGC AGC AGG GAG AGA 
Thr Lys He Leu Ala Ala Val Val Glu Gly Lys Ser Ser Arg Glu Arg> 

2000 2010 2020 2030 2040 

********** 

GAG AGA GAT TTT AGG AAA ATT AGG TCA GGC OCT AGA CAG TCA GGG AAC 
Glu Arg Asp Phe Arg Lys He Arg Ser Gly Pro Arg Gin Ser Gly Asn> 

2050 2060 2070 2080 

* ** * ** * ** 

CTG GGC AAT AGG ACC CCA CTC GAC AAG GAC CAG TGT GOG TAT TGT AAA 
Leu Gly Asn Arg Thr Pro Leu Asp Lys Asp Gin Cys Ala Tyr Cys Lys> 

2090 2100 2110 2120 2130 

*** ****** * 

GAA AAA GGA CAC TGG GCA AGG AAC TQC CCC AAG AAG GGA AAC AAA GGA 
Glu Lys Gly His Trp Ala Arg Asn Cys Pro Lys Lys Gly Asn Lys Gly> 

2140 2150 2160 2170 2180 

* i. * i •* * *** 

COG AAG GTC CTA GCT CTA GAA GAA GAT AAA GAT T AGGQGAGACG 
Pro Lys Val Leu Ala Leu Glu Glu Asp Lys Asp> 

2190 2200 2210 2220 2230 2240 

*★ ** ** ** ** ** 

■ GGGTTCGGAC CCCCTCCCOG AGCCCAGGGT AACTTTGAAG GTGGAGGGGC AACCAGTTGA 

2250 2260 2270 2280 2290 2300 

** ** ** ** ** ** 

GTTCCTQGTT GATACOGGAG CGGAGCATTC AGTGCTCCTA CAACCATTAG GAAAACTAAA 

2310 2320 2330 2340 2350 

* **** * ** * * 

AGAAAAAAAA TCCTGQGTC ATG GGT GOC ACA GGG CAA CGG CAG TAT CCA TOG 
Met Gly Ala Thr Gly Gin Arg Gin Tyr Pro Trp> 

2360 2370 2380 2390 2400 

** ******** 

ACT ACC OGA AGA ACC GIT GAC TTG GGA GTG GGA CGG CTA ACC CAC TOG 
Thr Thr Arg Arg Thr Val Asp Leu Gly Val Gly Arg Val Thr His Ser> 

2410 2420 2430 2440 

* ** * ** * ** 

TTT CTG GTC ATC OCT GAG TGC OCA GTA CCC CTT CTA GGT AGA GAC TTA 
Phe Leu Val He Pro Glu Cys Pro Val Pro Leu Leu Gly Arg Asp Leu> 

2450 2460 2470 2480 2490 

*** ****** * 

CTG ACC AAG ATG GGA GCT CAA ATT TCT TTT GAA CAA GGA AGA CCA GAA 
Leu Thr Lys Met Gly Ala Gin He Ser Phe Glu Gin Gly Arg Pro Glu> 



(SEQ ID NO: 2) cont'd 
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2500 2510 2520 2530 2540 (SEQ ID NO: 2) cont'd 

********* 
GTC TCT GIG AAT AAC AAA CCC ATC ACT GTC TTC ACC CTC CAA TTA GAT 
Val Ser Val Asn Asn Lys Pro He Thr Val Leu Thr Leu Gin Leu Asp> 

2550 2560 2570 2580 2590 

** * ** ★ ** * * 

GAT GAA TAT CGA CTA TAT TCT CCC CAA GTA AAG CCT GAT CAA GAT ATA 
Asp Glu Tyr Arg Leu Tyr Ser Pro Gin Val Lys Pro Asp Gin Asp Ile> 

2600 2610 2620 2630 2640 

********** 

CAG TCC TGG TTG GAG CAG TTT CCC CAA GCC TOG GCA GAA ACC GCA GQG 
Gin Ser Trp Leu Glu Gin Phe Pro Gin Ala Trp Ala Glu Thr Ala Gly> 

2650 2660 2670 2680 

* ** * ** * * * 

ATC GGT TTG GCA AAG CAA GTT CCC CCA CAG GTT ATT CAA CTG AAG GCC 
Met Gly Leu Ala Lys Gin Val Pro Pro Gin Val He Gin Leu Lys Ala> 

2690 2700 2710 2720 2730 

********** 

AGT GCT ACA CCA GTA TCA GTC AGA CAG TAC CCC TTG AGT AGA GAG GCT 
Ser Ala Thr Pro Val Ser Val Arg Gin Tyr Pro Leu Ser Arg Glu Ala> 

2740 2750 2760 2770 2780 

** * ** ■* ** ★ 

CGA GAA GGA ATT TGG COG CAT GTT CAA AGA TTA ATC CAA CAG GGC ATC 
Arg Glu Gly He Trp Pro His Val Gin Arg Leu He Gin Gin Gly Ile> 

2790 2800 2810 2820 2830 

** * ** * * * * * 

CTA GTT CCT GTC CAA TCC CCT TGG AAT ACT CCC CTG CTA COG GTT AGG 
Leu Val Pro Val Gin Ser Pro Trp Asn Thr Pro Leu Leu Pro Val Arg> 

2840 2850 2860 2870 2880 

******* * ** 

AAG CCT GGG ACC AAT GAT TAT OGA CCA GTA CAG GAC TTG AGA GAG GTC 
Lys Pro Gly Thr Asn Asp Tyr Arg Pro Val Gin Asp Leu Arg Glu Val> 

2890 2900 2910 2920 

* ** * ** * ** 

AAT AAA AGG GTC CAG GAC ATA CAC CCA AGG GTC COG AAC OCT TAT AAC 
Asn Lys Arg Val Gin Asp He His Pro Thr Val Pro Asn Pro Tyr Asn> 

2930 2940 2950 2960 2970 

* ** * *★ * ** * 

CTC TTG AGC GCC CTC COG CCT GAA COG AAC TGG TAC ACA GTA TTC GAC 
Leu Leu Ser Ala Leu Pro Pro Glu Arg Asn Trp Tyr Thr Val Leu Asp> 

2980 2990 3000 3010 3020 

********* 

TTA AAA GAT GCC TTC TTC TCC CTG AGA TTA CAC CCC ACT AGC CAA CCA 
Leu Lys Asp Ala Phe Phe Cys Leu Arg Leu His Pro Thr Ser Gin Pro> 

3030 3040 3050 3060 3070 

******** ** 

CTT TTT ACC TTC GAA TOG AGA GAT CCA GGT ACG GGA AGA ACC GGG CAG 
Leu Phe Thr Phe Glu Trp Arg Asp Pro Gly Thr Gly Arg Thr Gly Gln> 
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3080 3090 3100 3110 3120 (SEQ ID NO: 2) cont'd 



* * * ★ 


★ 




* 


* 


* 


* 


* 


CTC ACC TGG ACC CGA CTG CCC CAA 
Leu Tlir Trp Thr Arg Leu Pro Gin 


GGG TTC 
Gly Phe 


AAG AAC 

Lys Asn 


TCC COG 
Ser Pro 


ACC 
Thr 


ATC 
Ile> 


3130 3140 
★ * * * 


* 


3150 
* 


* 


3160 

*. 


★ 




TTT GAC GAA GCC CTA CAC 
Phe Asp Glu Ala Leu His 


AGG GAC 
Arg Asp 


CTG 
Leu 


GCC 
Ala 


AAC TTC 
Asn Phe 


AGG ATC 
Arg lie 


CAA 
Gin 


CAC 
His> 


3170 3180 

* * ★ * 


3190 
* 


* 


3200 
* 


3210 
★ * 




* 


CCT CAG GTG ACC CTC CTC 
Pro Gin Val Thr Leu Leu 


CAG TAC 
Gin Tyr 


GTG 
Val 


GAT 
Asp 


GAC CTG 
Asp Leu 


CTT CTG 
Leu Leu 


GOG 
Ala 


GGA 
Gly> 


3220 3230 
* * ★ 


3240 
★ * 




* 


3250 
* 


3260 
* * 




GCC ACC AAA CAG GAC TGC 
Ala Thr Lys Gin Asp Cys 


TTA GAA 
Leu Glu 


GGT 
Gly 


AOG 
Thr 


AAG GCA 
Lys Ala 


CTA CTG 
Leu Leu 


CTG 
Leu 


GAA 
Glu> 


3270 3280 
★ * * * 


3290 
* * 




3300 
* * 


* 


3310 
★ 


TTC TCT GAC CTA GGC TAG 
Leu Ser Asp Leu Gly Tyr 


AGA GCC 
Arg Ala 


TCT 
Ser 


GCT 
Ala 


AAG AAG 
Lys Lys 


GCC CAG 
Ala Gin 


ATT TGC 
lie Cys> 


3320 3330 
* ★ * * 


★ 


3340 
* 


3350 
* * 


3360 
* * 


AGG AGA GAG GTA ACA TAC 
Arg Arg Glu Val Thr Tyr 


TTG GGG 
Leu Gly 


TAC AGT 
Tyr Ser 


TTG GGG 
Leu Arg 


GGC GGG 
Gly Gly 


CAG 
Gin 


OGA 
Arg> 


3370 3380 
* * * * 


3390 
* ★ 


* 


3400 
* 


* 




TGG CTG ACG GAG GCA COG 
Trp Leu Thr Glu Ala Arg 


AAG AAA 
Lys Lys 


ACT 
Thr 


GTA 
Val 


GTC CAG 
Val Gin 


ATA COG 
lie Pro 


GCC 
Ala 


OCA 
Pro> 


3410 3420 

* * ★ * 


3430 
* 


* 


3440 
* 


3450 
* ★ 




* 


ACC ACA GCC AAA CAA GTG 
Thr Thr Ala Lys Gin Val 


AGA GAG 
Arg Glu 


TTT 
Phe 


TTG 
Leu 


GGG ACA 
Gly Thr 


GCT GGA 
Ala Gly 


TTT 

Phe 


TGC 
Cys> 


3460 ' 3470 
* * * 


3480 
* * 




★ 


3490 
* 


3500 

* * 




AGA CTG TGG ATC CCG GGG 
Arg Leu Trp lie Pro Gly 


TTT GOG 
Phe Ala 


ACC 
Thr 


TTA 
Leu 


GCA GCC 
Ala Ala 


OCA CTC TAC 
Pro Leu Tyr 


COG 
Pro> 


3510 3520 
* * * * 


3530 
* * 




3540 
* * 


★ 


3550 
* 


CTA ACC AAA GAA AAA GGG 


GGT TGC 


TTA 


OCT 


CAG CAG 


GGA GGG AAA TA AAG 



Leu Thr Lys Glu Lys Gly 

Lys Arg Gly Leu Leu Thr Ser Ala Gly Arg Glu lie Lys> 

3560 3570 3580 3590 3600 

* * * * * * * * * * 

AAC AAA GAG GAA ATT CTA AGC CTA TTA GAA GCC TTA CAT TTG CCA AAA 
Asn Lys Glu Glu lie Leu Ser Leu Leu Glu Ala Leu His Leu Pro Lys> 

3610 3620 3630 3640 3650 

********** 

AGG CTA GCT ATT ATA CAC TCT OCT GGA CAT CAG AAA GCC AAA GAT CTC 
Arg Leu Ala He He His Cys Pro Gly His Gin Lys Ala Lys Asp Leu> 
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3660 3670 3680 3690 ^ (ggQ j D N Q; 2) cont'd 





★ 




★ 


* 


* 




★ 


* * 






ATA 
He 


TCT 
Ser 


AGA GOG AAC CAG 
Arg Gly Asn Gin 


ATG 
Met 


GCT 
Ala 


GAC 
Asp 


CGG GTT GCC 
Arg Val Ala 


AAG CAG GCA GCC 
Lys Gin Ala Ala> 


3700 
★ 




3710 
★ * 


★ 


3720 
* 


* 


3730 
★ 


3740 
* * * 


CAG 
Gin 


GCT 
Ala 


GTT 
Val 


AAC 
Asn 


CTT CTG 
Leu Leu 


CCT 
Pro 


ATA 
He 


ATA 
He 


GAA ACG CCC 
Glu Thr Pro 


AAA 
Lys 


GCC CCA GAA 

Ala Drrt fll 1 i"s 
/iXa rXO VJrJ.UU > 


3750* 


* 


3760 
* 


3770 
* * 




3780 
* * 


★ 


3790 
* 


CCC AGA 
Pro Arg 


OGA 
Arg 


CAG 
Gin 


TAC ACC 
Tyr Thr 


CTA 
Leu 


GAA 
Glu 


GAC 
Asp 


TGG CAA GAG 
Trp Gin Glu 


ATA 
He 


AAA AAG ATA 
Lys Lys Ile> 


3800 
* * 




* 


3810 
* 


* 


3820 
* 


3830 
* * 




3840 
* * 


GAC 
Asp 


CAG 
Gin 


TIC 
Phe 


TCT 
Ser 


GAG ACT 
Glu Thr 


CCG 
Pro 


GAG 
Glu 


GGG 
Gly 


ACC TGC TAT 
Thr Cys Tyr 


ACC 
Thr 


TCA TAT GGG 
Ser Tyr Gly> 


★ 


3850 
★ 




3860 
* * 




★ 


3870 3880 
* * * 


3890 
* ★ 


AAG 
Lys 


GAA 
Glu 


ATC 
He 


CTG 
Leu 


CCC CAC 
Pro His 


AAA 
Lys 


GAA 
Glu 


GGG TTA GAA TAT 
Gly Leu Glu Tyr 


GTC 
Val 


CAA CAG ATA 
Gin Gin Ile> 




* 


3900 
* 


3910 
* ★ 




3920 
+ + ★ 


3930 

*■ * 


CAT 
His 


CGT 
Arg 


CTA ACC 
Leu Thr 


CAC CTA 
His Leu 


GGA 
Gly 


ACT 
Thr 


AAA CAC CTG CAG 
Lys His Leu Gin 


CAG TTC GTC AGA 
Gin Leu Val Arg> 


3940 




3950 


★ 


3960 
* 


★ 


3970 
* 


3980 
★ * * 


★ 

ACA 
Thr 


TCC 
Ser 


CCT 
Pro 


TAT 
Tyr 


CAT GTT 
His Val 


CTG AGG 
Leu Arg 


CTA CCA GGA GTC 
Leu Pro Gly Val 


GCT 
Ala 


GAC TOG GIG 
Asp Ser Val> 


3990 
★ 


* 


4000 
* 


4010 
* * 




4020 
* * 


★ 


4030 
* 


GTC AAA 
Val Lys 


CAT 
His 


TGT 
Cys 


GIG CCC 
Val Pro 


TGC 
Cys 


CAG 
Gin 


CTG GTT AAT GCT 
LejhVal Asn Ala 


AAT 
Asn 


CCT TCC AGA 
Pro Ser Arg> 


4040 
* * 






4050 
* 


* 


4060 
* 


4070 
* * 




4080 
★ * 


ATA 
He 


CCT 
Pro 


CCA 
Pro 


GGA 
Gly 


AAG AGA 
Lys Arg 


CTA 
Leu 


AGG 
Arg 


GGA AGC CAC CCA 
Gly Ser His Pro 


GGC 
Gly 


GCT CAC TGG 
Ala His Trp> 


* 


4090 
* 




4100 
* * 




* 


4110 4120 
* * * 


4130 
* * 



GAA GTC GAC TTC ACT GAG GTA AAG COG GCT AAA TAC GGA AAC AAA TAT 
Glu Val Asp Phe Ihr Glu Val Lys Pro Ala Lys Tyr Gly Asn Lys Tyr> 

4140 4150 4160 4170 

* * * * * * * * * 

CTA TTC GTT TTT GTA GAC ACC TTT TCA GGA TOG GTA GAG GCT TAT CCT 
Leu Leu Val Phe Val Asp Thr Phe Ser Gly Trp Val Glu Ala Tyr Pro> 

4180 4190 4200 4210 4220 

********* * 

ACT AAA AAA GAG ACT TCA ACC GTC GTC GCT AAG AAA ATA CTC GAG GAA 
Thr Lys Lys Glu Thr Ser Thr Val Val Ala Lys Lys He Leu Glu Glu> 
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4230 4240 4250 4260 4270 (SEQ ID NO: 2) cont'd 

* * * ** * • * * * 

ATT TTT CCA AGA TTT GGA ATA CCT AAG GTA ATA GGG TCA GAC AAT GOT 
He Phe Pro Arg Phe Gly He Pro Lys Val He Gly Ser Asp Asn Gly> 

4280 4290 4300 4310 4320 

********** 
CCA GCT TTC GTT GCC CAG GTA AGT CAG GGA CTG GCC AAG ATA TTG GGG 
Pro Ala Phe Val Ala Gin Val Ser Gin Gly Leu Ala Lys He Leu Gly> 



4330 4340 4350 4360 4370 4380 

* * ** ** ** ** ** 

ATT GAT TG A AAA CTG CAT TGT GCA TAC AGA CCC CAA AGC TCA GGA CAG 
He Asp Lys Leu His Cys Ala Tyr Arg Pro Gin Ser Ser Gly Gln> 



* 


4380 
* 


* 


4390 
* 




4400 
* * 




* 


4410 

* * 


GTA GAG 
Val Glu 


AGG ATG AAT 
Arg Met Asn 


AGA 
Arg 


ACC ATT 
Thr lie 


AAA 
Lys 


GAG ACC CTT 
Glu Thr Leu 


ACC AAA TTC ACC 
Thr Lys Leu Thr> 


4420 
* 


4430 
* * 




* 


4440 
* 


* 


4450 
* 




4460 
* * * 


ACA GAG 
Thr Glu 


ACT GGC ATT 
Thr Gly He 


AAT 
Asn 


GAT TOG 
Asp Trp 


ATG 
Met 


GCT 
Ala 


CTC 
Leu 


CTC 
Leu 


CCC 
Pro 


TTT GTC CTT 
Phe Val Leu> 


4470 


4480 




4490 

i * 




*■ 


4500 
* 


* 


4510 
* 


TTT AGG 
Phe Arg 


GTC AGG 
Val Arg 


AAC 
Asn 


ACC 
Thr 


CCT 
Pro 


GGA 
Gly 


CAG 
Gin 


TTT 
Phe 


GGG CTC 
Gly Leu 


ACC 
Thr 


CCC TAT AAA 
Pro Tyr Lys> 


4520 
* * 


* 


4530 
* 


* 


4540 
* 




4550 
* * 




4560 
* * 


TTG CTC 
Leu Leu 


TAC GGG 
Tyr Gly 


GGA CCC 
Gly Pro 


CCC 
Pro 


COG 
Pro 


TTC 
Leu 


GCA 
Ala 


GAA 
Glu 


ATT 
He 


GCC 
Ala 


TTT GCA CAT 
Phe Ala His> 


4570 
* * 


4580 
* * 




* 


4590 
* 


* 


4600 
* 


4610 
* * 


AGT GCT 
Ser Ala 


GAT GTC 
Asp Val 


CTG 
Leu 


CTT 
Leu 


TCC 
Ser 


CAG 
Gin 


CCT TTC 
Pro Leu 


TTC 
Phe 


TCT AGG 
Ser Arg 


CTC AAG GOG 
Leu Lys Ala> 


* 


4620 
* 


* 


4630 
* 




4640 
* * 




* 


4650 

* * 


CTC GAG 
Leu Glu 


TOG GTC 
Trp Val 


AGG 
Arg 


CAG 
Gin 


CGA 
Arg 


GOG 
Ala 


TOG 
Trp 


AAG 
Lys 


CAG 
Gin 


CTC 
Leu 


OQG GAG GCC TAC 
Arg Glu Ala Tyr> 


4660 
* 


4670 
* * 




* 


4680 
* 


* 


4690 
* 




4700 
* * * 


TCA GGA 
Ser Gly 


GGA GAC 
Gly Asp 


TTG 
Leu 


CAA 
Gin 


GTT CCA 
Val Pro 


CAT 
His 


OGC 
Arg 


TTC 
Phe 


CAA 
Gin 


GTT 
Val 


GGA GAT TCA 
Gly Asp Ser> 


4710 
* 


4720 
* ★ 




4730 
* * 




* 


4740 
* 


* 


4750 
* 


GTC TAT 

Val Tyr 


GTT AGA OGC 
Val Arg Arg 


CAC 
His 


CGT GCA 
Arg Ala 


GGA 
Gly 


AAC 
Asn 


CTC GAG 
Leu Glu 


ACT 
Thr 


COG TAG AAG 

Arg *** Lys> 


4760 
* * 


* 


4770 
★ 


* 


4780 
* 




* 

* 


4790 
* 




4800 
* * 


GGA CCT 
Gly Pro 


TAT CTC GTA CTT 
Tyr Leu Val Leu 


TTC ACC ACA 
Leu Thr Thr 


CCA 
Pro 


ACG GCT 
Thr Ala 


GTC 
Val 


AAA GTC GAA 
Lys Val Glu> 
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4810 




4820 


4830 




4840 




4850 


it 


★ 






• * 


* * 


★ 




* 




* <* 




CCC 


TTA 


AGC 


TTC GCC TCC ATC GCG 


TOG 


TTC 


CTT ACT 


CTG TCA 


Gly lie 


Pro 


Leu 


Ser 


Phe Ala Ser He Ala 


Trp 


Phe 


Leu Thr 


Leu Ser> 




4860 




4870 4880 






4890 








★ 


★ 




* * * 




★ 








AT'A APT 


OCT CAA 


GTT 


AAT 


GGT AAA -CGC CTT 


GIG 


GAC 


AGC 


COG 


AAC TCC 


lie Thr 


Pro Gin 


Val 


Asn 


Gly Lys Arg Leu 


Val 


Asp 


Ser 


Pro 


Asn Ser> 


y\j\j 


4910 






4920 4930 




4940 




it 


* 






* 


* * 


★ 




* 


★ 


* 


PAT AAA 


CCC 


TTA 


TCT 


CTC 


ACC TOG TTA CTT 


ACT 


GAC 


TCC 


GGT 


ACA GGT 


His Lys 


Pro 


Leu 


Ser 


Leu 


Thr Trp Leu Leu 


Thr 


Asp 


Ser 


Gly 


Thr Gly> 


4950 




4960 




4970 


4980 




4990 


* 


* 




* 




* ★ * 




* 


* 




* 


ATT AAT 


ATT 


AAC 


AGC 


ACT 


CAA GOG GAG GCT 


CCC 


TTC 


GGG 


ACC 


TOG TOG 


He Asn 


He 


Asn 


Ser 


Thr 


Gin Gly Glu Ala 


Pro 


Leu 


Gly 


Thr 


Trp Trp> 


5000 






5010 


5020 


5030 






5040 


* ★ 




+ 




★ 




★ 


★ 




★ 


★ 


CCT GAA 


TTA 


TAT 


GTC 


TGC 


CTT OGA TCA GTA 


ATC 


CCT 


GGT 


CTC 


AAT GAC 


Pro Glu 


Leu 


Tyr 


Val Cys 


Leu Arg Ser Val 


He 


Pro 


Gly 


Leu 


Asn Asp> 


5050 




5060 


5070 




5080 




5090 


* 


+ 




* 


* 


* * 


* 




* 




★ ★ 


CAG GCC 


ACA 


CCC 


CCC 


GAT 


GTA CTC CGT GCT 


TAC 


GGG 


TTT 


TAC 


GTT TGC 


Gin Ala 


Thr 


Pro 


Pro 


Asp 


Val Leu Arg Ala 


Tyr 


Gly 


Phe 


Tyr 


Val Cys> 




5100 




5110 5120 






5130 




★ 




* 


+ 




★ ★ ★ 




* 




★ 


* 


OCA GGA 


CCC CCA 


AAT 


AAT 


GAA GAA TAT TCT 


GGA 


AAT 


OCT 


CAG 


GAT TTC 


Pro Gly 


Pro 


Pro 


Asn 


Asn 


Glu Glu Tyr Cys 


Gly 


Asn 


Pro Gin 


Asp Phe> 


140 


5150 






5160 5170 




5180 




* 




★ 




★ 


* * 


* 




* 


* 


* 


TTT TGC 


AAG 


CAA 


TOG 


AGC 


TGC ATA ACT. TCT 


AAT 


GAT 


GGG 


AAT 


TOG AAA 


Phe Cys 


Lys 


Gin 


Trp 


Ser 


Cys He Thr Ser 


Asn 


Asp 


Gly 


Asn 


Trp Lys> 


5190 




5200 




5210 


5220 




5230 


★ 


★ 




* 




* * * 




* 


* 




★ 


TOG CCA 


GTC 


TCT 


CAG 


CAA 


GAC AGA GTA AGT 


TAC 


TCT 


TTT 


GTT 


AAC AAT 


Trp Pro 


Val 


Ser 


Gin 


Gin 


Asp Arg Val Ser 


Tyr Ser 


Phe 


Val 


Asn Asn> 


5240 






5250 


5260 


5270 






5280 


* * 








★ 


* * 


★ 


★ 




* 


★ 


CCT ACC 


AGT 


TAT 


AAT 


CAA 


TTT AAT TAT GGC 


CAT 


GGG 


AGA 


TOG 


AAA GAT 


Pro Thr 


Ser 


Tyr 


Asn Gin 


Phe Asn Tyr Gly 


His 


Gly 


Arg 


Trp 


Lys Asp> 


5290 




5300 


5310 




5320 




5330 


★ 


★ 




* 




* ★ 


* 




★ 




* * 


TOG CAA CAG 


COG 


GTA CAA 


AAA GAT GTA CGA 


AAT 


AAG CAA 


ATA 


AGC TCT 


Trp Gin Gin 


Arg 


Val 


Gin 


Lys Asp Val Arg 


Asn 


Lys Gin 


He 


Ser Cys> 




5340 




5350 5360 






5370 




* 




* 


* 




* * * 




* 




* 


★ 


CAT TOG TTA GAC 


CTA GAT TAC TTA AAA ATA 


AGT 


TTC ACT GAA 


AAA GGA 


His Ser Leu Asp 


Leu 


Asp 


Tyr Leu Lys He 


Ser 


Phe Thr Glu 


Lys Gly> 



(SEQ ID NO: 2) cont'd 
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(SEQ ID NO: 2) cont'd 

5380 5390 5400 5410 5420 v x ' 

* ★ ★ * * * ★ * * ★ 

AAA CAA GAA AAT ATT CAA AAG TGG GTA AAT GGT ATA TCT TOG GGA ATA 
Lys Gin Glu Asn He Gin Lys Trp Val Asn Gly He Ser Trp Gly Ile> 

5430 5440 5450 5460 5470 

* * * * * * * * * 

GTC TAC TAT GGA GGC TCT GOG AGA AAG AAA GGA TCT GIT CIG ACT ATT 
Val Tyr Tyr Gly Gly Ser Gly Arg Lys Lys Gly Ser Val Leu Thr Ile> 

5480 5490 5500 5510 5520 

* * * ** * * * * * 

CGC CTC AGA ATA GAA ACT CAG ATG GAA CCT OCG GTT GCT ATA GGA OCA 
Arg Leu Arg He Glu Thr Gin Met Glu Pro Pro Val Ala He Gly Pro 

5530 5540 5550 5560 

★ * * ***** 

AAT AAG GGT ITS GCC GAA CAA GGA CCT CCA ATC CAA GAA CAG 
Asn Lys Gly Leu Ala Glu Gin Gly Pro Pro He Gin Glu Gln> 



5570 
* * 




* 


5580 
* 


★ 


5590 
* 


5600 
* * 




* 


5610 
* 


AGG CCA TCT CCT 
Arg Pro Ser Pro 


AAC CCC TCT 
Asn Pro Ser 


GAT 
Asp 


TAC AAT 
Tyr Asn 


ACA 
Thr 


AOC TCT GGA 
Thr Ser Gly 


TCA GTC 
Ser Val> 


5620 
* * 




5630 
* * 




* 


5640 
* 


* 


5650 
* 




5660 


CCC ACT 
Pro Thr 


GAG CCT 
Glu Pro 


AAC 
Asn 


ATC ACT 
He Thr 


ATT 
He 


AAA ACA 
Lys Thr 


GGG 
Gly 


GOG 
Ala 


AAA CTT 
Lys Leu 


TTT AGC 
Phe Ser> 


* 


5670 
* 


* 


5680 
* 




5690 
* * 




* 


5700 
* 


★ 


CTC ATC 
Leu He 


CAG GGA 
Gin Gly 


GCT 
Ala 


TTT 
Phe 


CAA 
Gin 


GCT 
Ala 


CTT AAC 
Leu Asn 


TCC 
Ser 


AOG 
Thr 


ACT CCA 
Thr Pro 


GAG GCT 
Glu Ala> 


710 
* 


5720 
* * 




* 


5730 
* 


5740 
* * 




5750 
* * 


* 


ACC TCT 
Thr Ser 


TCT 
Ser 


TGT 
Cys 


TOG 
Trp 


CTT 
Leu 


TCC TTA 
Cys Leu 


GCT TOG 
Ala Ser 


GGC 
Gly 


CCA 
Pro 


CCT 
Pro 


TAC 
Tyr 


TAT GAG 
Tyr Glu> 


5760 
* 


* 


5770 
* 




5780 
* * 


* 


5790 
* 


* 


5800 
* 


GGA ATG 
Gly Met 


GCT 
Ala 


AGA 
Arg 


GGA 
Gly 


GOG 
Gly 


AAA 
Lys 


TTC 
Phe 


AAT GTC 
Asn Val 


ACA AAG 
Thr Lys 


GAA 
Glu 


CAT 
His 


AGA GAC 
Arg Asp> 


5810 
* * 




★ 


5820 
* 


* 


5830 
* 


5840 
* * 




* 


5850 

*. 


CAA TGT 
Gin Cys 


ACA 
Thr 


TOG 
Trp 


GGA TCC 
Gly Ser 


CAA 
Gin 


AAT 
Asn 


AAG CTT 
Lys Leu 


ACC 
Thr 


CTT 
Leu 


ACT 
Thr 


GAG 
Glu 


GTT TCT 
Val Ser> 


5860 
* * 




5870 
* * 




* 


5880 
* 


* 


5890 
* 




5900 
* * 


GGA AAA GGC 
Gly Lys Gly 


AOC 
Thr 


TCC ATA 
Cys He 


GGG 
Gly 


ATC 
Met 


GTT CCC 
Val Pro 


OCA 
Pro 


TCC 
Ser 


CAC 
His 


CAA 
Gin 


CAC CTT 
His Leu> 


* 


5910 
* 


* 


5920 
* 




5930 
* * 




★ 


5940 
* 


* 


TGT AAC CAC ACT 
Cys Asn His Thr 


GAA GOC TTT 
Glu Ala Phe 


AAT 
Asn 


OGA ACC 
Arg Thr 


TCT 
Ser 


GAG AGT CAA 
Glu Ser Gin 


TAT CTC 
Tyr Leu> 
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5950 5960 5970 5980 



5 99 o (SEQ ID NO: 2) cont'd 



GTA CCT GGT TAT GAC AGG TOG TOG GCA TGT AAT ACT GGA TTA ACC OCT 
Val Pro Gly Tyr Asp Arg Trp Trp Ala Cys Asn Thr Gly Leu Thr Pro 



6000 
* 


★ 


6010 
★ 


* 


5020 
* 


★ 


6030 
★ 


6040 
* ★ 


TGT GTT 
Cys Val 


TCC 
Ser 


ACC 
Thr 


TIG GTT 
Leu Val 


TTC 
Phe 


AAC CAA ACT 
Asn Gin Thr 


AAA GAC TTT TCC 
Lys Asp Phe Cys 


GTT ATC 
Val Met> 


6050 
* * 




★ 


6060 
* 


* 


6070 
* 


6080 
* * 


* 


6090 
* 


GTC CAA 
Val Gin 


ATT 
lie 


GTC 
Val 


CCC COG 
Pro Arg 


GTC 
Val 


TAC 
Tyr 


TAC TAT 
Tyr Tyr 


CCC GAA AAA GCA 
Pro Glu Lys Ala 


GTC CTT 
Val Leu> 


6100 
* * 




6110 
★ * 




* 


6120 
* 


6130 
* * 


6140 
★ * 


GAT GAA 
Asp Glu 


TAT 
TVr 


GAC 
Asp 


TAT AGA 
Tyr Arg 


TAT 
Tyr 


AAT 
Asn 


COG CCA 
Arg Pro 


AAA AGA 
Lys Arg 


GAG CCC 
Glu Pro 


ATA TCC 
He Ser> 


* 


6150 
* 


6160 
* ★ 




6170 
* * 


★ 


6180 
★ 


★ 


CTG ACA 
Leu Thr 


CTA GOT 
Leu Ala 


GTA ATG 
Val Met 


CTC 
Leu 


GGA 
Gly 


TTC GGA 
Leu Gly 


GTC GCT 
Val Ala 


GCA GGC 
Ala Gly 


GTC GGA 
Val Gly> 


6190 
* 


6200 
* * 


* 


6210 


6220 
* * 


6230 
* * 


★ 


ACA GGA 
Thr Gly 


ACG 
Thr 


GCT 
Ala 


GCC CTA 
Ala Leu 


ATC ACA 
He Thr 


GGA CCG 
Gly Pro 


CAA CAG 
Gin Gin 


CTC GAG 
Leu Glu 


AAA GGA 
Lys Gly> 


6240 
* 


★ 


6250 
* 


6260 
★ * 


★ 


6270 
* 


6280 
* * 


CTT AGT 
Leu Ser 


AAC 
Asn 


CTA 
Leu 


CAT OGA 
His Arg 


ATT 
He 


GTA 
Val 


AOG GAA 
Thr Glu 


GAT CTC 
Asp Leu 


CAA GCC 
Gin Ala 


CTA GAA 
Leu Glu> 


6290 
* * 




★ 


6300 
* 


* 


6310 
★ 


6320 
* * 


* 


6330 
* 


AAA TCT 
Lys Ser 


GTC 
Val 


AGT 
Ser 


AAC CTG 
Asn Leu 


GAG 
Glu 


GAA 
Glu 


TCC CTA 
Ser Leu 


ACC TCC 
Thr Ser 


TTA TCT 
Leu Ser 


GAA GTC 
Glu Val> 


6340 
* * 




6350 
* * 




★ 


6360 
* 


6370 
* * 


6380 
* * 


GTT CTA 
Val Leu 


CAG 
Gin 


AAC 
Asn 


AGA AGG 
Arg Arg 


GOG 
Gly 


TTA 
Leu 


GAT CTG 
Asp Leu 


TTA TTT 
Leu Phe 


CTA AAA 
Leu Lys 


GAA GGA 
Glu Gly> 


* 


6390 
* 


6400 
* * 




6410 
* * 


★ 


6420 
* 


* 


GGG TTA 
Gly Leu 


TGT GTA 
Cys Val 


GCC TTA 
Ala Leu 


AAA 
Lys 


GAG 
Glu 


GAA TCC 
Glu Cys 


TCC TTC 
Cys Phe 


TAT GTA 
Tyr Val 


GAT CAC 
Asp His> 


6430 


6440 
* * 


★ 


6450 
★ 


6460 
* * 


6470 
* ★ 


* 


TCA GGA 
Ser Gly 


GCC ATC 
Ala He 


AGA GAC 
Arg Asp 


TCC ATG 
Ser Met 


AGC AAG CTT AGA 
Ser Lys Leu Arg 


GAA AGG 
Glu Arg 


TTA GAG 
Leu Glu> 



6480 6490 6500 6510 6520 

* * ****** * 

AGG OGT OGA AGG GAA AGA GAG GCT GAC CAG GGG TOG TTT GAA GGA TOG 
Arg Arg Arg Arg Glu Arg Glu Ala Asp Gin Gly Trp Phe Glu Gly Trp> 
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6530 6540 6550 6560 6570 (SEQ ID NO: 2) cont'd 

********** 
TTC AAC AGG TCT OCT TOG ATG ACC ACC CTG CTT TCT GCT CTC AOG GGG 
Phe Asn Arg Ser Pro Trp Met Thr Thr Leu Leu Ser Ala Leu Thr Gly> 

6580 6590 6600 6610 6620 

********** 
CCC CTA GTA GTC CTC CTC CTG TTA CTT ACA GTT GGG OCT TGC TTA ATT 
Pro Leu Val Val Leu Leu Leu Leu Leu Thr Val Gly Pro Cys Leu Ile> 

6630 6640 6650 6660 

*** **** ★ * 

AAT AGG TTT GTT GCC TTT GTT AGA GAA CGA GTC AGT GCA GTC CAG ATC 
Asn Arg Phe Val Ala Phe Val Arg Glu Arg Val Ser Ala Val Gin Ile> 

6670 6680 6690 6700 6710 

********** 

ATG GTA CTT AGG CAA CAG TAC CAA GGC CTT CTC AGC CAA GGA GAA ACT 
Met Val Leu Arg Gin Gin Tyr Gin Gly Leu Leu Ser Gin Gly Glu Thr> 

6720 6730 6740 6750 6760 6770 

* ** ** ** ** ** 

GAC CTC TAGCCTTC CCAGTTCTAA GATTAGAACT ATTAACAAGA CAAGAAGTGG 
Asp Leu> 

6780 6790 6800 6810 6820 6830 

** * ★ ** ** ** ** 

GGAATGAAAG GATGAAAATG CAACCTAACC CTCCCAGAAC CCAGGAAGTT AATAAAAAGC 

6840 6850 6860 6870 6880 6890 

** *★ .★* ** ** ** 

TCTAAATGCC CCCGAATTCC AGACCCTCCT GGCTGCCAGT AAATAQGTAG AAGGTCACAC 

6900 6910 6920 6930 6940 6950 

** ** ** ** ** ** 

TTCCTATTGT TCCAGGGCCT GCTATCCTGG CCTAAGTAAG ATAACAGGAA ATGAGTTGAC 

6960 6970 6980 6990 7000 7010 

************ 
TAATCGCTTA TCTGGATTCT GTAAAACTGA CTG3CACCAT AGAAGAATTG ATTACACATT 

7020 7030 7040 7050 7060 7070 

* * ** ** ** ** ** 

GACAGCCCTA GTGACCTATC TCAACTOCAA TCTCTCACTC TGCCCAGGAG CCCACGCAGA 

7080 7090 7100 7110 7120 7130 

** ** ** ** ** ** 

TGCGGACCTC OQGAGCTATT TTAAAATGAT TGGTCCAOGG AGOGOGGGCT CTOGATATTT 

7140 7150 7160 7170 7180 7190 

* * * * ** ** ** ** 

TAAAATGATT GGTCCATGGA GOGCGGGCTC TOGATATTTT AAAATCATTG GTTTGTGAOG 

7200 7210 7220 7230 7240 7250 

** ** ** ** ** ** 

CACAGGCTTT GTTGTGAACC CCATAAAAGC TGTCCOGATT COGCACTOGG GGCCGCAGTC 
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7260 7270 7280 7290 7300 7310 (SEQ ID NO: 2) cont'd 

CTCTACCCCT GOCTQGTGTA CGACTCTQQG CCCCAGCGCG CTTGGAATAA AAATCCTCTT 



7320 7330 
* ★ * * 

GCTGTTTGCA TCAAAAAAAA AAA 
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io 20 30 40 so 60 (SEQ ID NO: 3) 

. * * * * * * * * * * , * * 

GCGTGGTCTA CGACTGTOGG CCCCAGCGQG CTTGGAATAA AAATCCTCTT GCTGTTTGCA 



70 80 90 100 110 120 

* * * * * * * * * • * * * 

TCAAGACCGC TTCTCGTGAG TGATTAAGGG GAGTCGCCTT TTCCGAGCCT GGAGGTTCTT 



130 140 150 160 170 180 

* * * * * * * * * * * * 

TTTGCTGGTC TTACATTTOG GGGCTCGTOC GGGATCTGTC GOQGCCACCC CTAACACCCG 

190 200 210 220 230 240 

* * * * * * * * * # * * 

AGAACCGACT TCGAGGTAAA AAGGATCCTC TTTTTAACGT GTATGCATGT ACCGGCCGGC 



250 260 270 280 290 300 

★ * * * * * * * * * * . * 

GTCTCTGTTC TGAGTGTCTG TTTTCAGTGG TGOGCGCTTT OGGTTTGCAG CTCTCCTCTC 



310 320 330 340 350 360 

★ * * * *★ * * * * * * 

AGGCCGTAAG GGCTGQGGGA CIGTGATCAG CAGACGTGCT AGGAGGATCA CAGGCTGCTG 



370 380 390 400 410 420 

★ * ** *★ ** ★* ** 

CCCTGGGQGA CGCCCOGGGA GGTGAGGAGA GCCAGGGACG CCTGGTGGTC TCCTACTGTC 



430 440 450 460 470 480 

** ★* ★* ★* *★ 

GGTCAGAGGA CCGAATTCrG TTGCTGAAGC GAAAGCTTCC CCCTCCGCGA CCGTCCGACT 



490 500 510 520 530 540 

** ** *★ ** ** *★ 

CTTTTGCCTC CTTCTGGAAG AOGTGGACGG GTCAOGTCTG TCTCGATCTG TTGGTTTCTG 



550 560 570 580 590 

******** ** * 

TTTTGTGTCT CTTTGTCTTG TGTGTCCTTC TCTACAGTTT TAAT ATG GGA CAG AOG 

Met Gly Gin Thr> 



600 


610 


620 


630 


640 




* * 




* * 


* * 




* * 




* 


GTG 


AOG ACC 


CCT 


CTT AGT TTG 


ACT CTC GAC 


CAT 


TGG ACT 


GAA 


GTT AAA 


Val 


Thr Thr 


Pro 


Leu Ser Leu 


Thr Leu Asp 


His 


Trp Thr 


Glu 


Val Lys> 




650 




660 


670 




680 




690 


* 


* 


* 


* * 


* 


* 


* 


* 


* 


TCC 


AGG OCT 


CAT 


AAT TTG TCA 


GTT CAG GTT 


AAG 


AAG GGA 


CCT 


TGG CAG 


Ser 


Arg Ala 


His 


Asn Leu Ser 


Val Gin Val 


Lys 


Lys Gly 


Pro 


Trp Gln> 




700 




710 


720 




730 




740 


* 


* 




* * 


* * 


* 


* 




* * 


ACT 


TTC TGT 


GTC 


TCT GAA TGG 


COG ACA TTC 


GAT 


GTT GGA 


TOG 


CCA TCA 


Thr 


Phe Cys 


Val 


Ser Glu Trp 


Pro Thr Phe 


Asp 


Val Gly 


Trp 


Pro Ser> 
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750 760 770 780 (SEQ ID NO: 3) cont'd 



* * * * 




* * 




★ 




* 


* 




GAG GGG ACC TTT AAT TCT GAG ATT 
Glu Gly Thr Phe Asn Ser Glu lie 


ATC CTG GCT GTT 
He Leu Ala Val 


AAA 
Lys 


GCA GTT 
Ala Val 


ATT 
Ile> 


790 800 810 
***** 


* 


820 
* 




* 


830 
* 




* 


TTT CAG ACT GGA OCC GGC TCT 
Phe Gin Thr Gly Pro Gly Ser 


CAT 
His 


CCC GAT 
Pro Asp 


CAG GAG 
Gin Glu 


CCC 
Pro 


TAT ATC 
Tyr He 


CTT 
Leu> 


840 850 

* * * * 


860 
* 


* 


870 
* 


* 


880 
* 




AOG TOG CAA GAT TTG GCA GAG 
Thr Trp Gin Asp Leu Ala Glu 


GAT 
Asp 


CCT CCG 
Pro Pro 


OCA 
Pro 


TGG 
Trp 


GTT 
Val 


AAA 

Lys 


CCA 
Pro 


TOG 
Trp> 


890 900 
* * * * ■ * 


910 
* 


* 


920 
* 




* 


930 
* 


CIG AAT AAG CCA AGA AAG CCA 
Leu Asn Lys Pro Arg Lys Pro 


GGT 
Gly 


CCC CGA 
Pro Arg 


ATT 
He 


CTG 
Leu 


GCT 
Ala 


CTT 
Leu 


GGA 
Gly 


GAG 
Glu> 


940 950 
* * * * 


* 


960 
* 


★ 


970 
* 




★ 


980 
* 


AAA AAC AAA CAC TCG GCT GAA 
Lys Asn Lys His Ser Ala Glu 


AAA 
Lys 


GTC AAG 
Val Lys 


OCC 
Pro 


TCT 
Ser 


CCT 
Pro 


CAT 
His 


ATC 
He 


TAC 
Tyr> 


/* 

990 1000 
* * * * 




1 Pin 
* * 




* 


1020 
* 


* 




CCC GAG ATT GAG GAG CCA CCG 
Pro Glu lie Glu Glu Pro Pro 


GCT 
Ala 


TGG CCG 
Trp Pro 


GAA 
Glu 


CCC 
Pro 


CAA TCT 
Gin Ser 


GTT 
Val 


CCC 
Pro> 


1030 1040 1050 
***** 


1060 
* * 




1070 
* * 




* 


CCA CCC OCT TAT CTG GCA CAG GGT 
Pro Pro Pro Tyr Leu Ala Gin Gly 


GCC GCG 
Ala Ala 


AGG 
Arg 


GGA 
Gly 


CCC 
Pro 


TTT 
Phe 


GCC 
Ala 


CCT 
Pro 


1080 1090 1100 
* * * * * 


* 


1110 
* 


* 


1120 
* 




CCT GGA GCT COG GOG GT3 GAG 
Pro Gly Ala Pro Ala Val Glu 


GGA 
Gly 


CCT GCT 
Pro Ala 


GCA GGG 
Ala Gly 


ACT 
Thr 


CGG 
Arg 


AGC 
Ser 


CGG 
Arg> 


1130 1140 
* * * * * 


1150 
* 


1160 
* * 




★ 


1170 
* 


AGG GGC GCC ACC CCG GAG CGG 
Arg Gly Ala Thr Pro Glu Arg 


ACA 
Thr 


GAC GAG 
Asp Glu 


ATC 
He 


GOG 
Ala 


ACA 
Thr 


TTA 
Leu 


COG CTG 
Pro Leu> 


1180 1190 
* * * * 


* 


1200 
* 


* 


1210 
* 




1220 
* * 


OGC AOG TAC GGC OCT CCC ACA 
Arg Thr Tyr Gly Pro Pro Thr 


COG 
Pro 


GGG GGC 
Gly Gly 


CAA 
Gin 


TTG 
Leu 


CAG 
Gin 


CCC 
Pro 


CTC 
Leu 


CAG 
Gln> 


1230 1240 
* * * * 




1250 
* * 




* 


1260 
* 


* 




TAT TGG CCC TTT TCT TCT GCA 
Tyr Trp Pro Phe Ser Ser Ala 


GAT 
Asp 


CTC TAT 
Leu Tyr 


AAT 
Asn 


TGG 
Trp 


AAA ACT 
Lys Thr 


AAC 
Asn 


CAT 
His> 


1270 1280 1290 
***** 


1300 
* * 




1310 
* * 




★ 


CCC CCT TTC TOG GAG GAT CCC CAA 
Pro Pro Phe Ser Glu Asp Pro Gin 


CGC CTC ACG 
Arg Leu Thr 


GGG 
Gly 


TTG 
Leu 


GTG 
Val 


GAG 
Glu 


TCC 
Ser> 
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1320 1330 1340 1350 1360 (SEQ ID NO: 3) cont'd 





★ 


* 




* 






* 


* 


* 


★ 


* 


CTT ATG 


TIC 


TCT 


CAC 


CAG 


CCT 


ACT TGG GAT 


GAT TGT 


CAA CAG 


CTG CTG 


Leu 


Met 


Phe 


Ser 


His Gin 


Pro 


Thr Trp Asp 


Asp Cys Gin Gin 


Leu Leu> 


1370 






1380 




1390 


1400 




1410 


* 


★ 




* 




* 


* 




* 


* * 


* 


★ 


CAG ACA 


CTC 


TTC 


ACA AOC 


GAG 


GAG 


CGA GAG 


AGA ATT 


CTA TTA 


GAG GCT 


Gin Thr 


Leu 


Phe 


Thr Thr 


Glu 


Glu 


Arg Glu 


Arg He Leu Leu 


Glu Ala> 




1420 




1430 






1440 


1450 


1460 






★ 




★ 


★ 




* 


* 


* 


* 


★ * 


AGA AAA 


AAT 


GTT 


OCT 


GGG 


GCC 


GAC 


GGG CGA 


CCC ACG 


CGG TTG 


CAA AAT 


Arg Lys 


Asn 


Val 


Pro 


Gly 


Ala 


Asp 


Gly Arg 


Pro Thr 


Arg Leu 


Gin Asn> 






1470 




1480 




1490 




1500 






★ 




* 


* 




* 




* * 


* 


* 


★ 


GAG ATT 


GAC ATG 


GGA 


TIT 


CCC 


TTA 


ACT OGC 


CCC GGT 


TGG GAC 


TAC AAC 


Glu lie 


Asp Met 


Gly 


Phe 


Pro 


Leu 


Thr Arg 


Pro Gly 


Trp Asp 


Tyr Asn> 


1510 




1520 






1530 


1540 


1550 




* 




★ 


★ 




★ 








* 


★ * 


* 


ACG GCT 


GAA 


GGT 


AGG 


GAG 


AGC 


TTG 


AAA ATC 


TAT CGC 


CAG GCT 


CTG GTG 


Thr Ala 


Glu 


Gly 


Arg 


Glu 


Ser 


Leu 


Lys He 


Tyr Arg 


Gin Ala 


Leu Val> 


1560 




1570 




1580 




1590 


1600 




★ 


* 




★ 






* 


* 




* 


★ - 


GOG GGT 


CTC 


COG 


GGC 


GCC 


TCA 


AGA 


CGG CCC 


ACT AAT 


TTG GCT 


AAG GTA 


Ala Gly 


Leu 


Arg 


Gly 


Ala 


Ser 


Arg 


Arg Pro 


Thr Asn 


Leu Ala 


Lys Val> 


1610 






1620 




1630 


1640 




1650 


* 


* 




* 




★ 


★ 




★ 


★ * 


★ 


* 


AGA 


GAA 


GTG 


ATG 


CAG GGA 


CCG 


AAT 


GAA CCC 


CCC TCT 


GTT TTT 


CTT GAG 


Arg 


Glu 


Val 


Met 


Gin Gly 


Pro 


Asn 


Glu Pro 


Pro Ser 


Val Phe 


Leu Glu> 




1660 




1670 






1680 


1690 


1700 


* 




★ 




* 






* 


* 


* 


* 


★ * 


AGG 


CTC 


TTG 


GAA 


GCC 


TTC 


AGG 


CGG 


TAC ACC 


CCT TTT 


GAT CCC 


ACC TCA 


Arg 


Leu 


Leu 


Glu 


Ala 


Phe 


Arg 


Arg 


Tyr Thr 


Pro Phe 


Asp Pro 


Thr Ser> 






1710 




1720 




1730 




1740 






* 




* 


* 




* 




* * 


* 


* 


* 


GAG 


GQC 


CAA AAA 


GCC 


TCA 


GTG 


GCT 


TTG GCC 


TTT ATA 


GGA CAG 


TCA GCC 


Glu 


Ala 


Gin Lys 


Ala 


Ser 


Val 


Ala 


Leu Ala 


Phe He 


Gly Gin 


Ser Ala> 


1750 




1760 






1770 


1780 


1790 




* 




* 


* 




* 




★ 


* 




★ * 


* 


TTG 


GAT 


ATT 


AGA 


AAG 


AAG 


CTT CAG 


AGA CTG 


GAA GGG 


TTA CAG 


GAG GCT 


Leu 


Asp 


He 


Arg 


Lys 


Lys 


Leu Gin 


Arg Leu 


Glu Gly 


Leu Gin 


Glu Ala> 


1800 




1810 




1820 




1830 


1840 




* 


* 




* 




* 


* 


* 


* 


* 


★ 


GAG TTA 


CGT 


GAT CTA 


GTG 


AAG GAG 


GCA GAG 


AAA GTA 


TAT TAC AAA AGG 


Glu Leu 


Arg 


Asp Leu 


Val 


Lys Glu 


Ala Glu 


Lys Val 


Tyr Tyr Lys Arg> 



1850 1860 1870 1880 1890 

* * * ***** ** 

GAG ACA GAA GAA GAA AGG GAA CAA AGA AAA GAG AGA GAA AGA GAG GAA 
Glu Thr Glu Glu Glu Arg Glu Gin Arg Lys Glu Arg Glu Arg Glu Glu> 
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(SEQ ID NO: 3) cont'd 

1900 1910 1920 1930 1940 





★ 


* * * * 


* 


* 








AGG GAG 
Arg Glu 


GAA AGA 
Glu Arg 


CGT AAT AAA CGG CAA GAG 
Arg Asn Lys Arg Gin Glu 


AAG AAT 
Lys Asn 


TTG ACT 
Leu Thr 


AAG 
Lys 


ATC 
Ile> 


★ 


1950 
* 


1960 1970 
* * * * 


★ 


1980 
* 


★ 




TTG GCT 
Leu Ala 


GCA GIG 
Ala Val 


GTT GAA GGG AAA AGC AAT 
Val Glu Gly Lys Ser Asn 


ACG GAA 
Thr Glu 


AGA GAG 
Arg Glu 


AGA 
Arg 


GAT 
Asp> 


1990 
★ 


2000 
* * 


2010 2020 
* * * * 


2030 
* * 






TTT AGG 
Phe Arg 


AAA ATT 
Lys lie 


AGG TCA GGC OCT AGA CAG 
Arg Ser Gly Pro Arg Gin 


TCA GGG 
Ser Gly 


AAC 
Asn 


CTC 
Leu 


GGC 
Gly 


AAT 
Asn> 


2040 
★ 


2050 2060 
★ * * * * 


2070 
* 


★ 


2080 
* 




AGG ACC 
Arg Thr 


CCA CTC 
Pro Leu 


GAC AAG GAC CAA TGT GCA 
Asp Lys Asp Gin Cys Ala 


TAT TGT 
Tyr Cys 


AAA 
Lys 


GAA AGA 
Glu Arg 


GGA 
Gly> 


2090 
* * 


* 


2100 2110 
* * * 


2120 
★ * 




* 


2130 
* 


CAC TGG 
His Trp 


GCA AGG 
Ala Arg 


AAC TQC CCC AAG AAG GGA 
Asn Cys Pro Lys Lys Gly 


AAC AAA 
Asn Lys 


GGA 
Gly 


CCA AGG ATC 
Pro Arg Ile> 


2140 
* * 


2150 2160 
* * * * 


2170 
* * 




2180 
* * 





CTA GCT CTA GAA GAA GAT AAA GAT TAGG GGAGAOGGGG TTCGGACCCC 
Leu Ala Leu Glu Glu Asp Lys Asp> 

2190 2200 2210 2220 2230 2240 

* ★ * * * * * * * * * * 

CTCCOOGAGC CCAGGGTAAC TTTGAAGGTC GAGGGGCAAC CAGTTCAGTT CCTOGTTGAT 

2250 2260 2270 2280 2290 2300 

* * * * . * * * * * * ** 

ACCGGAGCGA AACATTCAGT GCTACTACAG CCATTAGGAA AACTAAAAGA TAAAAAATCC 

2310 2320 2330 2340 2350 

******* * * * 

TQQGTC ATC GGT GCC ACA GGG CAA CAA CAG TAT CCA TOG ACT ACC CGA AGA 
Met Gly Ala Thr Gly Gin Gin Gin Tyr Pro Trp Thr Thr Arg Arg> 

2360 2370 2380 2390 

* * ★ * * * ★* * 

ACA GTT GAC TTG GGA GTC GGA OQG GTA ACC CAC TOG TTT CTC GTC ATA 
Thr Val Asp Leu Gly Val Gly Arg Val Thr His Ser Phe Leu Val Ile> 

2400 2410 2420 2430 2440 

********** 

CCT GAG TCC CCA GCA CCC CTC TTA GGT AGA GAC TTA TTG ACC AAG ATC 
Pro Glu Cys Pro Ala Pro Leu Leu Gly Arg Asp Leu Leu Thr Lys Met> 

2450 2460 2470 2480 2490 

* ★ ***** *** 

GGA GCA CAA ATT TCT TTT GAA CAA GGG AAA CCA GAA GTC TCT GCA AAT 
Gly Ala Gin He Ser Phe Glu Gin Gly Lys Pro Glu Val Ser Ala Asn> 
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2500 2510 2520 2530 2540 (SEQ ID NO: 3) confc 



★ 




* 








★ 


* 








* 


* 


AAC AAA CCT 
Asn Lys Pro 


He 


ACT GTC 
Thr Val 


TTC ACC CTC CAA 
Leu Thr Leu Gin 


1 Art 

Leu 


GAT GAC 
Asp Asp 


GAA 
Glu 


TAT OGA 
Tyr Arg> 


. 2550 
★ * 


* 


2560 
* 


2570 
★ * 




* 


2580 
★ 


★ 


2590 
★ 


CTA TAC TCT 
Leu Tyr Ser 


CCC 
Pro 


CTA 
Leu 


GTA 
Val 


AAG OCT 
Lys Pro 


GAT CAA 
Asp Gin 


AAT 
Asn 


ATA CAA 
He Gin 


TIC 
Phe 


TOG TTG 
Trp Leu> 




2600 
* 




★ 


2610 

★ * 


2620 
★ 




2630 




* 


GAA CAG 
Glu Gin 


TIT 

Phe 


CCC 
Pro 


CAA 
Gin 


GCC TOG GCA 
Ala Trp Ala 


GAA 
Glu 


ACC 
Thr 


GCA 
Ala 


GGG 
Gly 


ATC 
Met 


GGT 
Gly 


TTC GCA 
Leu Ala> 


2640 

* ★ 


2650 
* 




2660 
* * 


★ 


2670 
* 


* 


2680 


* 


AAG CAA 
Lys Gin 


GTT 
Val 


CCC 
Pro 


CCA 
Pro 


CAA 
Gin 


GTT ATT 
Val He 


CAA 
Gin 


CTC AAG 
Leu Lys 


GCC 
Ala 


AGT 
Ser 


GCC 
Ala. 


ACA OCA 
Thr Pro> 


2690 
* 


* 


2700 
★ 


* 


2710 
* 




2720 
★ ★ 




* 


2730 

* ★ 


GTC TCA 
Val Ser 


GTC 
Val 


AGA CAG 
Arg Gin 


TAC 
Tyr 


CCC TTG 
Pro Leu 


AGT 
Ser 


AAA 
Lys 


GAA 
Glu 


GCT 
Ala 


CAA 
Gin 


GAA GGA ATT 
Glu Gly Ile> 


2740 
* 




2750 
* ★ 




2760 
+ * 


★ 


2770 
★ 




2780 
★ * 


COG CCG 
Arg Pro 


CAT 
His 


GTC 
Val 


CAA 
Gin 


AGA 
Arg 


TTA ATC CAA 
Leu He Gin 


CAG 
Gin 


GGC 
Gly 


ATC 
He 


CTA 
Leu 


GTT 
Val 


CCT GTC 
Pro Val> 


2790 
★ * 


★ 


2800 
* 


2810 




* 


2820 
★ 


★ 


2830 
* 


. CAA TCT CCC 
Gin Ser Pro 


TGG 
Trp 


AAT 
Asn 


ACT 
Thr 


CCC CTC 
Pro Leu 


CTA 
Leu 


CCG 
Pro 


GTT 
Val 


AGA AAG 
Arg Lys 


CCT 
Pro 


GGG ACT 
Gly Thr> 


2840 
* * 




★ 


2850 

* ★ 


2860 
* 




2870 
* * 




* 


AAT GAC 
Asn Asp 


TAT 
Tyr 


CGA 
Arg 


CCA 
Pro 


GTA CAG GAC 
Val Gin Asp 


TTC 
Leu 


AGA 
Arg 


GAG 
Glu 


GTC 
Val 


AAT 
Asn 


AAA 
Lys 


COG GTC 
Arg Val> 


2880 

★ * 


2890 
* 




2900 
* * 


* 


2910 
* 


* 


2920 
* 


* 


CAG GAT 
Gin Asp 


ATA 
He 


CAC 
His 


CCA 
Pro 


ACA 
Thr 


GTC COG 
Val Pro 


AAC 
Asn 


CCT TAT 
Pro Tyr 


AAC 
Asn 


CTC 
Leu 


TTC 
Leu 


TCT GCT 
Cys Ala> 


2930 
* 


★ 


2940 
★ 


* 


2950 
* 




2960 
* * 




* 


2970 

* * 


CTC CCA 
Leu Pro 


CCC 
Pro 


CAA COG 
Gin Arg 


AGC 
Ser 


TOG TAT 
Trp Tyr 


ACA 
Thr 


GTA 
Val 


TTG 
Leu 


GAC 
Asp 


TTA 
Leu 


AAG GAT GCC 
Lys Asp Ala> 


2980 
* 




2990 
* * 




3000 
* * 


★ 


3010 
* 




3020 
* * 


TTC TTC 

Phe Phe 


TGC 

Cys 


CTC AGA 

Leu Arg 


TTA 
Leu 


CAC CCC ACT 
His Pro Thr 


AGC 
Ser 


CAA CCA 

Gin Pro 


CTT 

Leu 


TTT GCC TTC 
Phe Ala Phe> 


3030 
* * 




3040 
* 


3050 
★ * 




* 


3060 
* 




3070 
* 


GAA TCG AGA 
Glu Trp Arg 


GAT CCA GGT 
Asp Pro Gly 


ACG GGA AGA 
Thr Gly Arg 


ACC 
Thr 


GGG CAG CTC 
Gly Gin Leu 


ACC TOG ACC 
Thr Trp Thr> 
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3080 3090 3100 3110 (SEQ ID NO: 3) cont'd 





* 


* 




* 




* 


* 




★ 




* 


★ 






CGA CTG 
Arg Leu 


OCC CAA GGG 
Pro Gin Gly 


TTC AAG AAC 
Phe Lys Asn 


TCC 
Ser 


CCG ACC 
Pro Thr 


ATC 
He 


TTT GAC GAA GCC 
Phe Asp Glu Ala> 


3120 
* 


★ 


3130 
* 




3140 
★ ★ 




* 


3150 
* 


★ 


3160 
* 




CPA CAC 
Leu His 


AGA 
Arg 


GAC CTG 
Asp Leu 


GCC 
Ala 


AAC TTC 
Asn Phe 


AGG 
Arg 


ATC CAA 
He Gin 


CAC 
His 


OCT 
Pro 


CAG GTG ACC 
Gin Val Thr> 


3170 
★ 




* 


3180 


* 


3190 
★ 




3200 
* * 




* 


3210 

* * 


CTC 
Leu 


CTC 
Leu 


CAG 
Gin 


TAC GIG 
Tyr Val 


GAT 
Asp 


GAC 
Asp 


CTG 
Leu 


CTT 
Leu 


CTC 
Leu 


GCG 
Ala 


GGA 
Gly 


GCC 
Ala 


ACC AAA CAG 
Thr Lys Gln> 


3220 
★ 




3230 
★ * 




★ 


3240 


★ 


3250 
★ 




3260 
★ * 


GAC 
Asp 


TGC 
Cys 


TTA 
Leu 


GAA 
Glu 


GGC 
Gly 


AOS 
Thr 


AAG 
Lys 


GCA CTA 
Ala Leu 


CTG 
Leu 


CTC 
Leu 


GAA 
Glu 


TTG 
Leu 


TCT 
Ser 


GAC CTA 
Asp Leu> 


* 


3270 
★ 


* 


3280 
* 




3290 
★ * 




* 


3300 
★ 


* 


3310 
* 


GGC 
Gly 


TAC AGA 
Tyr Arg 


GCC 
Ala 


TCT 
Ser 


GCT 
Ala 


AAG 
Lys 


AAG 
Lys 


GCC 
Ala 


CAG 
Gin 


ATT 
He 


TGC AGG 
Cys Arg 


AGA 
Arg 


GAG GTA 
Glu Val> 




. 3320 
* * 




* 


3330 


* 


3340 
* 




3350 
* * 




★ 


ACA 
Thr 


TAC 
Tyr 


TTG 
Leu 


GGG 
Gly 


TAC 
Tyr 


AGT TTG 
Ser Leu 


CGG 
Arg 


GAC 
Asp 


GGG 
Gly 


CAG 
Gin 


CGA 
Arg 


TGG 
Trp 


CTG 
Leu 


AOG GAG 
Thr Glu> 


3360 
★ 


★ 


3370 
* 




3380 
* * 




★ 


3390 
* 


* 


3400 
* 


★ 


. GCA 
Ala 


CGG 
Arg 


AAG 
Lys 


AAA 
Lys 


ACT 
Thr 


GTA 
Val 


GTC 
Val 


CAG 
Gin 


ATA 
lie 


CCG GCC 
Pro Ala 


CCA 
Pro 


ACC 
Thr 


ACA 
Thr 


GCC AAA 
Ala Lys> 


3410 
* 




* 


3420 
* 


* 


3430 
* 




3440 
* * 




★ 


3450 

* * 


CAA 
Gin 


ATG 
Met 


AGA 
Arg 


GAG 
Glu 


TTT 
Phe 


TIG 
Leu 


GGG 
Gly 


ACA 
Thr 


GCT 
Ala 


GGA 
Gly 


TTT 
Phe 


TGC 
Cys 


AGA 
Arg 


CTG TGG ATC 
Leu Trp Ile> 


3460 
* 




3470 
★ * 




* 


3480 
* 


* 


3490 
+ 




3500 
* * 


COG 
Pro 


GGG 
Gly 


TTT 
Phe 


GOG 
Ala 


ACC 
Thr 


TTA 
Leu 


GCA 
Ala 


GCC CCA 
Ala Pro 


CTC 
Leu 


TAC 
Tyr 


COG 
Pro 


CTA 
Leu 


ACC 
Thr 


AAA GAA 
Lys Glu> 


* 


3510 
★ 


* 


3520 
★ 




3530 
* * 




★ ' 


3540 
* 


★ 


3550 
* 


AAA 
Lys 


GGG GAA 
Gly Glu 


TTC 
Phe 


TCC 
Ser 


TOG 
Trp 


GCT 
Ala 


OCT 
Pro 


GAG 
Glu 


CAC 
His 


CAG 
Gin 


AAG GCA 
Lys Ala 


TTT 
Phe 


GAT GCT 
Asp Ala> 




3560 
* * 




* 


3570 
★ 


* 


3580 
★ 




3590 
★ * 




* 


ATC 
He 


AAA AAG 

Lys Lys 


GCC 
Ala 


CTG 
Leu 


CTG AGC 
Leu Ser 


GCA 
Ala 


CCT GCT 
Pro Ala 


CTG 
Leu 


GCC CTC 
Ala Leu 


CCT 
Pro 


GAC GTA 
Asp Val> 


3600 
* 


* 


3610 
* 




3620 
* ★ 




* 


3630 
* 




3640 
* 


★ 


ACT 
Thr 


AAA OCC TTT 
Lys Pro Phe 


ACC 
Thr 


CTT TAT 
Leu Tyr 


GTC 
Val 


GAT GAG OGT 
Asp Glu Arg 


AAG GGA GTA 
Lys Gly Val 


GCC OQG 
Ala Arg> 
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3650 3660 3670 3680 3690 



(SEQ ID NO: 3) com', 



* 




* 




★ 


* 








★ 


★ 




* 




* 


★ 


GGA GOT TTA 


ACC CAA ACC 


CTA GGA OCA 


TGG 


AGA AGA CCT 


GTC 


GCC 


TAC 


Gly 


Val 


Leu 


Thr Gin Thr 


Leu Gly Pro 


Trp 


Arg Arg Pro 


Val 


Ala 


Tyr> 


3700 






3710 






3720 




3730 




3740 






* 




•*- 


* 








★ 


★ 




* 




★ 


* 




CTG 


TCA 


AAG 


AAG 


CTC 


GAT 


CCT GTA GCC 


AGT 


GGT 


TOG 


CCC 


ATA 


TGC 


CTG 


Leu 


Ser Lys 


Lys 


Leu Asp 


Pro Val Ala 


Ser 


Gly 


Trp Pro 


He 


Cys 


Leu> 




3750 




3760 




3770 






3780 




3790 


* 




★ 


* 




* 




* 


★ 




+ 




* 


* 




* 


AAG 


GCT ATC 


GCA 


GCT 


GTG 


GCC ATA 


CTG 


GTC 


AAG 


GAC 


GCT 


GAC 


AAA 


TTG 


Lys 


Ala 


He 


Ala 


Ala 


Val 


Ala 


He 


Leu 


Val 


Lys 


Asp Ala 


Asp 


Lys 


Leu> 




3800 






3810 




3820 




3830 










★ • 


* 




★ 




★ 


★ 




★ 




* 


* 




★ 




ACT 


TTG 


GGA 


CAG 


AAT 


ATA ACT GTA 


ATA 


GCC 


CCC 


CAT 


GCA 


TTG 


GAG 


AAC 


Thr 


Leu 


Gly 


Gin 


Asn 


He Thr Val 


He 


Ala 


Pro 


His 


Ala 


Leu 


Glu 


Asn> 


3840 




3850 




3860 






3870 




3880 






* 


* 








★ 


* 




* 






★ 




* 




* 


ATC 


GTT 


CGG 


CAG 


CCC 


CCA 


GAC 


OGA 


TGG 


ATG ACC 


AAC 


GCC 


CGC 


ATG 


ACC 


He 


Val 


Arg 


Gin 


Pro 


Pro 


Asp Arg 


Trp 


Met 


Thr 


Asn 


Ala 


Arg 


Met 


Thr> 


3890 






3900 




3910 




3920 






3930 




★ 




★ 




★ 


* 




★ 




★ 


* 




* 




* 


* 


CAC 




CAA 


AGC 


CTG 


CTT 


CTC 


ACA 


GAG 


AGG 


GTC 


AOG 


TTC 


GCT 


CCA 


OCA 


His 


Tyr 


Gin 


Ser Leu 


Leu 


Leu 


Thr 


Glu 


Arg 


Val 


Thr 


Phe 


Ala 


Pro 


Pro> 


3940 




3950 






3960 




3970 




3980 






* 




* 






* 




* 


* 




★ 




* 


* 




GCC 


GCT 


CTC 


AAC 


OCT 


GCC 


ACT 


CTT CTG 


CCT 


GAA 


GAG 


ACT 


GAT 


GAA 


CCA 


Ala 


Ala 


Leu 


Asn 


Pro 


Ala 


Thr 


Leu 


Leu 


Pro 


Glu 


Glu 


Thr 


Asp 


Glu 


Pro 




3990 




4000 




4010 






4020 




4030 


★ 




★ 


* 




★ 




★ 


★ 




★ 




* 


* 




.* 


GTG 


ACT 


CAT 


GAT 


TGC 


CAT 


CAA 


CTA 


TTG 


ATT 


GAG 


GAG 


ACT 


GGG 


GTC 


CGC 


Val 


Thr His 


Asp 


Cys 


His 


Gin 


Leu 


Leu 


He 


Glu 


Glu 


Thr 


Gly 


Val 


Arg> 




4040 






4050 




4060 




4070 










* 


* 




★ 




★ 


* 




★ 




* 


* 




* 




AAG 


GAC 


CTT 


ACA 


GAC 


ATA COG 


CTG 


ACT 


GGA 


GAA 


GTG 


CTA 


ACC 


TGG 


TTC 


Lys 


Asp 


Leu 


Thr 


Asp 


He Pro 


Leu 


Thr 


Gly 


Glu 


Val 


Leu 


Thr 


Trp 


Phe> 


4080 




4090 




4100 






4110 




4120 












★ 




* 


* 




* 




* 


* 




★ 




* 


ACT 


GAC 


GGA 


AGC 


AGC 


TAT 


GTG 


GTG 


GAA 


GGT AAG 


AGG 


ATG 


GCT 


GGG 


GOG 


Thr 


Asp 


Gly 


Ser 


Ser 


Tyr 


Val 


Val 


Glu 


Gly Lys 


Arg 


Met 


Ala 


Gly 


Ala> 


4130 






4140 




4150 




4160 






4170 




★ 




* 




★ 


* 




★ 




★ 


★ 




* 




* 


★ 


GOG 


GTG 


GTG 


GAC 


GGG 


ACC 


CGC ACG 


ATC 


TGG GCC 


AGC 


AGC 


CTG 


COG 


GAA 


Ala 


Val 


Val 


Asp Gly 


Thr 


Arg Thr 


He 


Trp Ala 


Ser 


Ser 


Leu 


Pro 


Glu> 


4180 




4190 






4200 




4210 




4220 






★ 




★ 


* 




* 






* 




* 




★ 


* 




GGA ACT 


TCA 


GCA 


CAA 


AAG 


GCT GAG CTC 


ATG GCC 


CTC 


ACG 


CAA GCT 


TTG 


Gly Thr- 


Ser 


Ala Gin 


Lys 


Ala Glu Leu 


Met Ala Leu 


Thr 


Gin Ala 


Leu> 
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4230 4240 4250 4260 4270 ( SE Q ID N ° : 3) COni'd 



COG CTG GCC GAA GGG AAA TCC ATA AAC ATT TAT ACG GAC AGC AQG TAT 

Arg Leu Ala Glu Gly Lys Ser He Asn He Tyr Thr Asp Ser Arg T/r> 

4280 4290 4300 4310 

****** * * * 

GCC TTT GCG ACT GCA CAC GTA CAT GGG GCC ATC TAT AAA CAA AGG GGG 

Ala Phe Ala Thr Ala His Val His Gly Ala lie Tyr Lys Gin Arg Gly> 



4320 
* 


* 


4330 




4340 
* * 




★ 


4350 
* 


* 


4360 
* 




* 


TTG 
Leu 


CTT 
Leu 


ACC 
Thr 


TCA 
Ser 


GCA 
Ala 


GGG 
Gly 


AGG 
Arg 


GAA 
Glu 


ATA 
He 


AAG AAC 
Lys Asn 


AAA 
Lys 


GAG 
Glu 


GAA 
Glu 


ATT 
He 


CTA 
Leu> 


4370 
* 




* 


4380 
★ 


* 


4390 
* 




4400 
★ * 




* 


4410 
* 


★ 


AGC 
Ser 


CTA 
Leu 


TTA 
Leu 


GAA GCC 
Glu Ala 


GTA 
Val 


CAT 
His 


TTA 
Leu 


CCA 
Pro 


AAA 
Lys 


AGG 
Arg 


CTA 
Leu 


GCT 
Ala 


ATT ATA 
He He 


CAC 
His> 


4420 
* 




4430 
* * 




* 


4440 
* 


* 


4450 
* 




4460 
* * 




TGT 
Cys 


CCT 
Pro 


GGA 
Gly 


CAT 
His 


CAG 
Gin 


AAA 
Lys 


GCT 
Ala 


AAA GAT 
Lys Asp 


CTC 
Leu 


ATA 
He 


TCC 
Ser 


AGA 
Arg 


GGA 
Gly 


AAC 
Asn 


CAG 
Gln> 


* 


4470 
* 


* 


4480 
* 




4490 
* ★ 




* 


4500 
★ 


* 


4510 


ATC 
Met 


GCT GAC 
Ala Asp 


CGG 
Arg 


GTT 
Val 


GCC 
Ala 


AAG 
Lys 


CAG 
Gin 


GCA 
Ala 


GCC 
Ala 


CAG 
Gin 


GGT GTT 
Gly Val 


AAC 
Asn 


CTT 
Leu 


CTG 
Leu> 




4520 




★ 


4530 
★ 


★ 


4540 
* 




4550 
* * 




★ 




. CCT 
Pro 


ATA 
He 


ATA 
He 


GAA 
Glu 


ATG 
Met 


CCC AAA 
Pro Lys 


GCC 
Ala 


CCA 
Pro 


GAA 
Glu 


CCC 
Pro 


AGA 
Arg 


OGA 
Arg 


CAG 
Gin 


TAC 
Tyr 


ACC 
Thr> 


4560 
★ 


* 


4570 




4580 
* * 




* 


4590 
* 


★ 


4600 
★ 




* 


CTA 
Leu 


GAA 
Glu 


GAC 
Asp 


TOG 
Trp 


CAA 
Gin 


GAG 
Glu 


ATA 
He 


AAA 
Lys 


AAG 
Lys 


ATA GAC 
He Asp 


CAG 
GLn 


TTC 
Phe 


TCT 
Ser 


GAG 
Glu 


ACT 
Thr> 


4610 
* 




* 


4620 
* 


* 


4630 
* 




4640 
* * 




* 


4650 
* 


* 


COG 
Pro 


GAA 
Glu 


GGG 
Gly 


ACC TGC 
Thr Cys 


TAT 
Tyr 


ACC 
Thr 


TCA 
Ser 


GAT 
Asp 


GGG 
Gly 


AAG 
Lys 


GAA 
Glu 


ATC 
He 


CTG 
Leu 


CCC 
Pro 


CAC 
His> 


4660 
* 




4670 
* * 




★ 


4680 
* 


* 


4690 
★ 




4700 
* * 




AAA 
Lys 


GAA 
Glu 


GGG 
Gly 


TTA 
Leu 


GAA 
Glu 


TAT 
Tyr 


GTC 
Val 


CAA CAG 
Gin Gin 


ATA 
He 


CAT 
His 


CGT 
Arg 


CTA 
Leu 


ACC 
Thr 


CAC 
His 


CTA 
Leu> 


* 


4710 
* 


* 


4720 
* 




4730 
* * 




* 


4740 
* 


* 


4750 
* 


GGA 
Gly 


ACT AAA 
Thr Lys 


CAC 
His 


CTG CAG 
Leu Gin 


CAG 
Gin 


TTG GTC 
Leu Val 


AGA 
Arg 


ACA 
Thr 


TCC 
Ser 


OCT 
Pro 


TAT 
Tyr 


CAT GTT 
His Val> 




4760 
* * 




* 


4770 
* 


* 


4780 
* 




4790 
* * 




★ 




CTG 
Leu 


AGG CTA 
Arg Leu 


CCA 
Pro 


GGA GTG GCT 
Gly Val Ala 


GAC TOG GTG 
Asp Ser Val 


GTC 
Val 


AAA CAT 
Lys His 


TGT 
Cys 


GTG CCC 
Val Pro> 
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4800 4810 4820 , 4830 4840 (SEQ ID NO: 3) cont'd 



* * 






* 


+ 




★ 








★ 




* 


TGC CAG CTG 
Cys Gin Leu 


GTT AAT 
Val Asn 


GCT 
Ala 


AAT CCT TCC 
Asn Pro Ser 


AGA ATG CCT 
Arg Met Pro 


OCA 
Pro 


GGG AAG 
Gly Lys 


AGA 
Arg> 


4850 

* * 


4860 
* 


★ 


4870 
★ 




4880 
★ * 


* 


4890 
★ 


* 


CTA AGG GGA 
Leu Arg Gly 


AGC CAC 
Ser His 


OCA 
Pro 


GGC 
Gly 


GCT CAC 
Ala His 


TGG 
Trp 


GAA GTG 
Glu Val 


GAC 
Asp 


TTC ACT 
Phe Thr 


GAG 
Glu> 


4900 
* 


4910 
* * 




* 


4920 
* 


★ 


4930 
* 




4940 
* * 




GTA AAG CCG 
Val Lys Pro 


GCT 
Ala 


AAA 
Lys 


TAC 
Tyr 


GGA 
Gly 


AAC AAA 
Asn Lys 


TAC 
Tyr 


CTA TTG 
Leu Leu 


GTT 
Val 


TTT 
Phe 


GTA 
Val 


GAC 
Asp> 


4950 
* * 


★ 


4960 
* 




4970 
* * 




4980 
* * 


* 


4990 
* 


ACC TTT TCA 
Thr Phe Ser 


GGA 
Gly 


TGG 
Trp 


GTA 
Val 


GAG 
Glu 


GCT 
Ala 


TAT 
Tyr 


CCT 
Pro 


ACT AAG AAA 
Thr Lys Lys 


GAG 
Glu 


ACT 
Thr 


TCA 
Ser> 


5000 
* * 




★ 


5010 
★ 


* 


5020 
* 


5030 
* * 




* 




ACC GTG GTG 
Thr Val Val 


GCT 
Ala 


AAA 
Lys 


AAA ATA 
Lys He 


CTG 
Leu 


GAA 
Glu 


GAA 
Glu 


ATT TTT 
He Phe 


CCA 
Pro 


AGA 
Arg 


TTT 
Phe 


GGA 
Gly> 


)40 5050 
★ ★ * 




5060 
* * 




★ 


5070 

* * 


5080 
* 




★ 


ATA CCT AAG 
He Pro Lys 


GTA 
Val 


ATA 
He 


GGG 
Gly 


TCA 
Ser 


GAC 
Asp 


AAT 
Asn 


GGT CCA GCT 
Gly Pro Ala 


TTT 
Phe 


GTT 
Val 


GCC 
Ala 


CAC 
Gln> 


5090 

* ★ 


5100 
* 


* 


5110 




5120 
* * . 


* 


5130 
* 


★ 


GTA AGT CAG 
Val Ser Gin 


GGA CTG 
Gly Leu 


GCC 
Ala 


AAG 
Lys 


ATA 
He 


TTG 
Leu 


GGG 
Gly 


ATT GAT 
He Asp 


TOG 
Trp 


AAA CTG 
Lys Leu 


CAT 
His> 


5140 
★ 


5150 
* * 




★ 


5160 
★ 


* 


5170 
★ 




5180 
* * 




TGT GCA TAC 
Cys Ala Tyr 


AGA 
Arg 


CCC 
Pro 


CAA 
Gin 


AGC 
Ser 


TCA GGA 
Ser Gly 


CAG 
Gin 


GTA GAG 
Val Glu 


AGG 
Arg 


ATG 
Met 


AAT 
Asn 


AGA 
Arg> 


5190 
* * 


★ 


5200 
* 




5210 
* * 




5220 
★ * 


* 


5230 
★ 


ACC ATT AAA 
Thr He Lys 


GAG 
Glu 


ACC 
Thr 


CTT 
Leu 


ACT 
Thr 


AAA 
Lys 


TTG 
Leu 


ACC 
Thr 


GCG GAG ACT 
Ala Glu Thr 


GGC 
Gly 


GTT 
Val 


AAT 
Asn> 



5240 5250 5260 5270 

* * * * * * * * * 

GAT TGG ATA GCT CTC CTG CCC TTT GTG CTT TTT AGG GTT AGG AAC ACC 
Asp Trp He Ala Leu Leu Pro Phe Val Leu Phe Arg Val Arg Asn Thr> 



5280 5290 
★ * * 

CCT GGA CAG TTT 
Pro Gly Gin Phe 



5300 
* * 

GGG CTG ACC CCC 
Gly Leu Thr Pro 



5310 
* * * 

TAT GAA TTA CTC 
Tyr Glu Leu Leu 



5320 

* . * 

TAC GGG GGA CCC 
Tyr Gly Gly Pro 



5330 5340 5350 5360 5370 

* * ******** 

CCC CCA TTC GTA GAA ATT GCT TCT GTA CAT AGT GCT GAC GTC CTC CTT 
Pro Pro Leu Val Glu He Ala Ser Val His Ser Ala Asp Val Leu Leu> 
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5380 5390 5400 5410 5420 (SEQ ID NO: 3) cont'd 

********* 
TCC CAG CCT TTG TTC TCT AGG CTC AAG GCA CTT GAG TCG GTC AGA CAA 
Ser Gin Pro Leu Phe Ser Arg Leu Lys Ala Leu Glu Trp Val Arg Gln> 

5430 5440 5450 5460 5470 

********** 

CGA GCG TGG AGG CAA CTC OGG GAG GCC TAC TCA GGA GGA GGA GAC TTG 
Arg Ala Trp Arg Gin Leu Arg Glu Ala Tyr Ser Gly Gly Gly Asp Leu> 

5480 5490 5500 5510 

********* 

CAG ATC CCA CAT CGT TTC CAA GTG GGA GAT TCA GTC TAC GTT AGA OGC 
Gin He Pro His Arg Phe Gin Val Gly Asp Ser Val Tyr Val Arg Arg> 

5520 5530 5540 5550 5560 

** * ** * * * * ■ * 

CAC CGT GCA GGA AAC CTC GAG ACT COG TGG AAG GGC CCT TAT CTC GTA 
His Arg Ala Gly Asn Leu Glu Thr Arg Trp Lys Gly Pro Tyr Leu Val> 

5570 5580 5590 5600 5610 

* * ** * ** * * ★ 

CTT TTG ACC ACA CCA ACG GCT GTG AAA GTC GAA GGA ATC TCC AOC TGG 
Leu Leu Thr Thr Pro Thr Ala Val Lys Val Glu Gly He Ser Thr Trp> 

5620 5630 5640 5650 5660 

* ** * ** * ** 

ATC CAT GCA TCC CAC GTT AAA CCG GCG CCA CCT CCC GAT TCG GGG TGG 
Met His Pro Thr Leu Asn Arg Arg His Leu Pro He Arg Gly Gly> 

He His Ala Ser His Val Lys Pro Ala Pro Pro Pro Asp Ser Gly Trp> 

5670 5680 5690 5700 5710 

* * * * * * * ** * 

AAA GCC GAA AAG ACT GAA AAT CCC CTT AAG CTT OGC CTC CAT CGC GTG 
Lys Pro Lys Arg Leu Lys He Pro Leu Ser Phe Ala Ser He Ala Trp> 

Lys Ala Glu Lys Thr Glu Asn Pro Leu Lys Leu Arg Leu His Arg Val> 

5720 5730 5740 5750 5760 

** * ** •* * * *★ 

GTT CCT TAC TCT GTC AAT AAC CTC TCA GAC T AAT GGT ATG CGC ATA GGA 
Phe Leu Thr Leu Ser He Thr Ser Gin Thr Asn Gly Met Arg He Gly> 

Val Pro Tyr Ser Val Asn Asn Leu Ser Asp> 

5770 5780 5790 5800 

********* 

GAC AGC CTG AAC TCC CAT AAA CCC TTA TCT CTC AOC TOG TTA ATT ACT 
Asp Ser Leu Asn Ser His Lys Pro Leu Ser Leu Thr Trp Leu He Thr> 

5810 5820 5830 5840 5850 

* ** * ** * ** * 

GAC TCC GGC ACA GGT ATT AAT ATC AAC AAC ACT CAA GGG GAG GCT CCT 
Asp Ser Gly Thr Gly He Asn He Asn Asn Thr Gin Gly Glu Ala Pro> 
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5860 5870 5880 5890 5900 

* * * * * * * ★ + 

TTA GGA ACC TOG TOG CCT GAT CTA TAC GTT TCC CIC AGA TCA GTT ATT 
Leu Gly Thr Trp Trp Pro Asp Leu Tyr Val Cys Leu Ai-g Ser Val Ile> 

5910 5920 5930 5940 5950 

CCT AGT CTC ACC TCA CCC CCA GAT ATC CTC CAT GCT CAC GGA TTT TAT 
Pro Ser Leu Thr Ser Pro Pro Asp lie Leu His Ala His Gly Phe Tyr> 

5960 5970 5980 5990 6000 

********** 

GTT TGC CCA GGA CCA CCA AAT AAT GGA AAA CAT TCC GGA AAT CCC AGA 
Val Cys Pro Gly Pro Pro Asn Asn Gly Lys His Cys Gly Asn Pro Arg> 

6010 6020 6030 6040 

* ****** ** 

GAT TTC TTT TGT AAA CAA TOG AAC TGT GTA ACC TCT AAT GAT GGA TAT 
Asp Phe Phe Cys Lys Gin Trp Asn Cys Val Thr Ser Asn Asp Gly Tyr> 

6050 6060 6070 6080 6090 

* ** **** ** * 

TOG AAA TOG CCA ACC TCT CAG CAG GAT AGG GTA AGT TTT TCT TAT GTC 
Trp Lys Trp Pro Thr Ser Gin Gin Asp Arg Val Ser Phe Ser Tyr Val> 

6100 6110 6120 6130 6140 



(SEQ ID NO: 3) cont'd 



★ * 


* * 


* * 


* 


★ 




* 


AAC ACC TAT ACC 


AGC TCT GGA 


CAA TTT AAT 


TAC CTC 


ACC 


TOG 


ATT AGA 


Asn Thr Tyr Thr 


Ser Ser Gly 


Gin Phe Asn 


Tyr Leu 


Thr 


Trp 


lie Arg> 


6150 


6160 


6170 


6180 






6190 


* * * 


* * 


* 


* * 




* 


* 


ACT GGA AGC CCC 


AAG TGC TCT 


CCT TCA GAC 


CTA GAT 


TAC 


CTA 


AAA ATA 


Thr Gly Ser Pro 


Lys Cys Ser 


Pro Ser Asp 


Leu Asp 


Tyr 


Leu 


Lys Ile> 


6200 


6210 


6220 


6230 




6240 


* * 


* * 


* * 


* 


* 




* * 


AGT TTC ACT GAG 


AAA GGA AAA 


CAA GAA AAT 


ATC CTA 


AAA 


TOG 


GTA AAT 


Ser Phe Thr Glu 


Lys Gly Lys 


Gin Glu Asn 


lie Leu 


Lys 


Trp 


Val Asn> 



6250 


6260 


6270 




6280 




* * * 




* 


* 


* 


■ * 


* * 




GGT ATC TCT TOG GGA 


ATC 


GTA TAT 


TAT 


GGA 


GGC TOG 


GGT AAA CAA 


CCA 


Gly Met Ser Trp Gly 


Met 


Val Tyr 


Tyr 


Gly 


Gly Ser 


Gly Lys Gin 


Pro 


6290 6300 




6310 




6320 


6330 




* * * 


* 


* 


* 




* 


* * 


* 


GGC TCC ATT CTA ACT 


ATT 


CGC CTC 


AAA 


ATA 


AAC CAG 


CTC GAG CCT 


OCA 


Gly Ser lie Leu Thr 


lie 


Arg Leu 


Lys 


He 


Asn Gin 


Leu Glu Pro 


Pro 


6340 6350 




6360 






6370 


6380 




* * ★ 




* * 




* 


* 


* * 




ATC GCT ATA GGA CCA 


AAT 


AOG GTC 


TTC 


AOG 


GGT CAA 


AGA CCC CCA 


ACC 


Met Ala lie Gly Pro 


Asn 


Thr Val 


Leu 


Thr 


Gly Gin 


Arg Pro Pro 


Thr> 



6390 6400 6410 6420 6430 

*•* * ** * ** * * 

CAA GGA OCA GGA CCA TCC TCT AAC ATA ACT TCT GGA TCA GAC CCC ACT 
Gin Gly Pro Gly Pro Ser Ser Asn He Thr Ser Gly Ser Asp Pro Thr> 
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6440 6450 6460 6470 6480 (SEQ ID NO: 3) cont'd 



GAG TCT AAC AGC ACG ACT AAA ATG GGG GCA AAA CTT TTT AGC CTC ATC 
Glu Ser Asn Ser Thr Thr Lys Met Gly Ala Lys Leu Phe Ser Leu Ile> 

6490 6500 6510 6520 

* * ★ ★ ★ ★ * * * 

CAG GGA GCT TTT CAA GCT CTT AAC TCC ACG ACT CCA GAG GCT ACC TCT 
Gin Gly Ala Phe Gin Ala Leu Asn Ser Thr Thr Pro Glu Ala Thr Ser> 

6530 6540 6550 6560 6570 

★ * * * + * * ** ★ 

TCT TCT TGG CTA TGC TTA GCT TCG GGC CCA OCT TAC TAT GAA GGA ATG 
Ser Cys Trp Leu Cys Leu Ala Ser Gly Pro Pro Tyr Tyr Glu Gly Met> 



6580 


6590 




6600 






6610 




6620 




★ * 


* 




* * 




* 


* 


* 


* 




GCT AGA AGA 


GOG AAA 


TTC 


AAT GIG 


ACA 


AAA 


GAA CAT 


AGA 


GAC CAA 


TGC 


Ala Arg Arg 


Gly Lys 


Phe 


Asn Val 


Thr 


Lys 


Glu His 


Arg 


Asp Gin 


Cys> 



6630 


6640 




6650 


6660 




6670 


★ 


* * 


* 


* 




★ 


* 


* 


* 


★ 


ACA 


TGG GGA TCC 


CAA AAT 


AAG 


CTT 


ACC CTT 


ACT 


GAG 


GTT TCT 


GGA AAA 


Thr 


Trp Gly Ser 


Gin Asn 


Lys 


Leu 


Thr Leu 


Thr 


Glu 


Val Ser 


Gly Lys> 




.6680 


6690 






6700 




6710 


6720 


+ 


* 


* * 




★ 


★ 


* 




* 


★ * 


GGC 


ACC TCC ATA 


GGA AAG 


GTT 


CCC 


CCA TCC 


CAC 


CAA 


CAC CTT 


TCT AAC 


Gly 


Thr Cys lie 


Gly Lys 


Val 


Pro 


Pro Ser 


His 


Gin 


His Leu 


Cys Asn> 




6730 


6740 




6750 






6760 






> * 


* 


★ 




★ * 




* 


* 


* 


CAC 


ACT GAA GCC 


TTT AAT 


CAA 


ACC 


TCT GAG 


AGT 


CAA 


TAT CTC 


GTA CCT 


His 


Thr Glu Ala 


Phe Asn 


Gin 


Thr 


Ser Glu 


Ser 


Gin 


Tyr Leu 


Val Pro> 



6770 




6780 






6790 




6800 




6810 






* 




* * 




* 


* 


★ 


★ 




★ * 




+ 


GGT 


TAT 


GAC AGG 


TOG 


TOG 


GCA TCT 


AAT 


ACT GGA 


TTA 


ACC CCT 


TCT 


GTT 


Gly 


Tyr 


Asp Arg 


Trp 


Trp 


Ala Cys 


Asn 


Thr Gly 


Leu 


Thr Pro 


Cys 


Val> 



6820 




6830 




6840 






6850. 




6860 




★ 


★ 


* 




* * 




* 


★ 


★ 


* 




TCC ACC 


TTC 


GTT TTT 


AAC 


CAA ACT 


AAA 


GAT 


TTT TGC 


ATT 


ATC GTC 


CAA 


Ser Thr 


Leu 


Val Phe 


Asn 


Gin Thr 


Lys 


Asp 


Phe Cys 


lie 


Met Val 


Gln> 



6870 6880 6890 6900 6910 

******** ** 

ATT GTT CCC CGA GTC TAT TAC TAT CCC GAA AAA GCA ATC CTT GAT GAA 
He Val Pro Arg Val Tyr Tyr Tyr Pro Glu Lys Ala He Leu Asp Glu> 

6920 6930 6940 6950 6960 

******* * * * 

TAT GAC TAC AGA AAT CAT CGA CAA AAG AGA GAA CCC ATA TCT CTC ACA 
Tyr Asp Tyr Arg Asn His Arg Gin Lys Arg Glu Pro He Ser Leu Thr> 

6970 6980 6990 7000 

* ★* * • * * * * * 

CTT GCT GTC ATC CTC GGA CTT GGA GTC GCA GCA GGT GTA GGA ACA GGA 
Leu Ala Val Met Leu Gly Leu Gly Val Ala Ala Gly Val Gly Thr Gly> 
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7010 7020 7030 7040 

********** 

ACA GCT GCC CTG GTC AOG GGA CCA CAG CAG CTA GAA ACA GGA CTT AGT 
Thr Ala Ala Leu Val Thr Gly Pro Gin Gin Leu Glu Thr Gly Leu Ser> 

7060 7070 7080 7090 7100 



7050 (SEQ ID NO: TO cont'd 





* 


* 




* 


* 


* 




★ 




* 


★ 






AAC 
Asn 


CTA 
Leu 


CAT 
His 


CGA 
Arg 


ATT GTA ACA 
He Val Thr 


GAA GAT 
Glu Asp 


CTC 
Leu 


CAA GCC 
Gin Ala 


CTA 
Leu 


GAA 
Glu 


AAA TCT 
Lys Ser> 


7110 
★ * 




* 


7120 

* * 


7130 
* 




7140 
* * 




* 


7150 
* 


GTC 
Val 


AGT 
Ser 


AAC 
Asn 


CTC 
Leu 


GAG GAA TCC 
Glu Glu Ser 


CTA 
Leu 


ACC 
Thr 


TCC 
Ser 


TTA 
Leu 


TCT 
Ser 


GAA 
Glu 


GTA 
Val 


GTC CTA 
Val Leu> 


* 


7160 
* 




7170 
* * 


* 


7180 
* 


* 


7190 
* 




7200 
* * 


CAG 
Gin 


AAT 
Asn 


AGA 
Arg 


AGA 
Arg 


GGG 
Gly 


TTA GAT 
Leu Asp 


TTA 
Leu 


TTA TTT 
Leu Phe 


CTA 
Leu 


AAA 
Lys 


GAA 
Glu 


GGA 
Gly 


GGA TTA 
Gly Leu> 




* 


7210 
* 


★ 


7220 
* 




7230 
* * 




* 


7240 
★ 


* 


TGT 
. Cys 


GTA 
Val 


GCC TIG 
Ala Leu 


AAG 
Lys 


GAG GAA 
Glu Glu 


TCC 
Cys 


TCT 
Cys 


TTT 
Phe 


TAT 
Tyr 


GTC 
Val 


GAT CAT 
Asp His 


TCA GGG 
Ser Gly> 


7250 
* 




7260 
* * 




7270 
* * 


* 


7280 
* 




7290 
* * 


* 


GCC 
Ala 


ATC 
He 


AGA 
Arg 


GAC 
Asp 


TCC 
Ser 


ATC AAC AAG 
Met Asn Lys 


CTT 
Leu 


AGA 
Arg 


GAA 
Glu 


AGG 
Arg 


TTC 
Leu 


GAC 
Glu 


AAG CGT 
Lys Arg> 


7300 
* 


* 


7310 
* 


7320 
* * 




* 


7330 
* 


★ 


7340 
* 


OGA AGG 
Arg Arg 


GAA 
Glu 


AAG 
Lys 


GAA 
Glu 


ACT ACT 
Thr Thr 


CAA 
Gin 


GGG 
Gly 


TOG 
Trp 


TTT 
Phe 


GAG 
Glu 


GGA 
Gly 


TOG 
Trp 


TTC AAC 
Phe Asn> 


7350 
* * 




* 


7360 

* * 


7370 
* 




7380 
* * 




* 


7390 
* 


AGG 
Arg 


TCT 
Ser 


CTT 
Leu 


TGG 
Trp 


TTC GCT ACC 
Leu Ala Thr 


CTA 
Leu 


CTT 
Leu 


TCT 
Ser 


GCT 
Ala 


TTA 
Leu 


ACA 
Thr 


GGA 
Gly 


CCC TTA 
Pro Leu> 


* 


7400 
* 




7410 
* * 


* 


7420 
* 


* 


7430 
* 




7440 
* * 


ATA 
He 


GTC 
Val 


CTC 
Leu 


CTC 
Leu 


CTC 
Leu 


TTA CTC 
Leu Leu 


ACA 
Thr 


GTT GGG 
Val Gly 


OCA 
Pro 


TCT 
Cys 


ATT 
He 


ATT 
He 


AAC AAG 
Asn Lys> 




* 


7450 
* 


* 


7460 
* 




7470 
* * 




* 


7480 
* 


* 


TTA 
Leu 


ATT 
He 


GCC TTC 
Ala Phe 


ATT 
He 


AGA GAA 
Arg Glu 


CGA 
Arg 


ATA AGT 
He Ser 


GCA 
Ala 


GTC 
Val 


CAG ATC 
Gin He 


ATC GTA 
Met Val> 


7490 
* 




7500 
* * 




7510 
* * 


* 


7520 
* 




* 


7530 
* 




CTT 
Leu 


AGA 
Arg 


CAA CAG 
Gin Gin 


TAC 
Tyr 


CAA AGC CCG 
Gin Ser Pro 


TCT AGC AGG 
Ser Ser Arg 


GAA 
Glu 


GCT GGC 
Ala Gly 


CGC 
Arg> 



7540 7550 7560 7570 7580 7590 

** ** ** ** ** ** 

TAGCTCT ACCAGTTCTA AGATTAGAAC TATTAACAAG AGAAGAAGTC GGGAATCAAA 



FIGURE 3,CONT. 



Page 34 of 34 

MOLECULAR SEQUENCE OF SWINE RETROVIRUS AND METHODS OF USE 

Jay A. Fishman 
14846-011004 



7600 7610 7620 7630 7640 7650 (SEQ ID NO: 3) cont'd 

* * * * * * * * * * * * 

GGATCAAAAT ACAACCTAAG CTAATCAGAA GCTTAAAATT GTTCTGAATT CCAGAGTTTG 

7660 7670 7680 7690 .7700 7710 

* * ** * * . * * * * * * 

TTCCTTATAG GTAAAAGATT AGGTTTTTTG CTGTTTTAAA ATATGCQGAA GTAAAATAGG 

7720 7730 7740 7750 7760 7770 

* ★ * * * * * * * * * * 

CCCTGAGTAC ATGTCTCTAG GCATGAAACT TCTTGAAACT ATTTGAGATA ACAAGAAAAG 

7780 7790 7800 7810 7820 7830 

* * ** *★ *★ ** ★* 

GGAGTTTCTA ACIGCTTGTT TAGCTTCTGT AAAACTGGTT GCGCCATAAA GATGTTGAAA 

7840 7850 7860 7870 7880 7890 

*★ ** ** * * ★* ** 

TGTTGATACA CATATCTTGG TGACAACATG TCTCCCCCAC CCCGAAACAT GCGCAAATGT 

7900 7910 7920 7930 7940 7950 

********* *** 

GTAACTCTAA AACAATTTAA ATTAATTGGT CCACGAAGCG CGGGCTCTCG AAGTTTTAAA 

7960 7970 7980 7990 8000 8010 

** ** ** ** ** ** 

TTGACTGGTT TGTGATATTT TGAAATGATT GGTTTGTAAA GOGCGGQCTT TCTTGTGAAC 

8020 8030 8040 8050 8060 8070 

** ** ** ** ** ** 

CCCATAAAAG CTGTCCCGAC TCCACACTCG GGGCCGCAGT CCTCTACCCC TGOGTGGTGT 

8080 8090 8100 8110 8120 8130 

** ** **.*♦ ** ** 

AOGACTGTGG GCCCCAGCGC GCTTGGAATA AAAATCCTCT TGCTGTTTGC ATCAAAAAAA 
AA 
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